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1  Introduction 
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The  connections  between  logic  and  computer  science  are  growing  rapidly  and  are 
becoming  deeper.  Besides  theorem  proving,  logic  programming,  and  program  specifi¬ 
cation  and  verification,  other  areas  showing  a  fascinating  mutual  interaction  with  logic 
include  type  theory,  concurrency,  artificial  intelligence,  complexity  theory,  databases, 
operational  semantics,  and  compiler  techniques.  The  concepts  presented  in  this  paper 
are  motivated  by  the  need  to  understand  and  relate  the  many  different  logics  currently 
being  used  in  computer  science,  and  by  the  related  need  for  new  approaches  to  the 
rigorous  design  of  computer  systems.  Logic  programming  is  of  course  one  of  the  areas 
where  logic  and  computer  science  interact  most  strongly.  The  attempt  to  better  un¬ 
derstand  the  nature  of  this  interaction,  as  well  as  its  future  prospects,  motivates  the 
following  basic  question: 

What  is  Logic  Programming ? 

This  paper  tries  to  make  precise  the  meaning  of  this  question,  and  to  answer  it  in 
terms  of  general  axioms  which  apply  to  a  wide  variety  of  different  logics.  In  doing  so, 
we  are  inevitably  led  to  ask  the  more  fundamental  question: 

What  is  a  Logic? 

That  is,  how  should  general  logics  be  axiomatized?  This  is  because  an  axiomatic  notion 
of  logic  programming  must  necessarily  rest  on  an  axiomatic  notion  of  logic  itself.  Most 
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of  the  paper  will  be  devoted  to  the  second  question.  With  an  axiomatic  notion  of  logic 
already  in  place,  it  will  then  answer  the  first. 

Beyond  their  application  to  logic  programming,  the  axioms  for  general  logics  given 
here  are  sufficiently  general  to  have  wide  applicability  within  logic  and  computer  sci¬ 
ence.  Thus,  this  work  has  goals  that  are  in  full  agreement  with  those  of  J.A.  Goguen 
and  R.  Burstall’s  theory  of  institutions  [27,26];  however,  it  addresses  proof-theoretic 
aspects  not  addressed  by  institutions.  In  fact,  institutions  can  be  viewed  as  the  model- 
theoretic  component  of  the  present  theory.  The  main  new  contributions  include  a  gen¬ 
eral  axiomatic  theory  of  entailment  and  proof,  to  cover  the  proof-theoretic  aspects  of 
logic  and  the  many  proof-theoretic  uses  of  logic  in  computer  science;  they  also  include 
new  notions  of  mappings  that  interpret  one  logic  (or  proof  calculus)  in  another,  an 
axiomatic  study  of  categorical  logics ,  and  the  axioms  for  logic  programming. 

Logic  has  of  course  a  long  tradition  of  reflecting  upon  itself.  In  the  process  of  seeking 
the  present  axiomatization,  I  have  reviewed  a  variety  of  previous  axiomatizations  of 
logic.  In  essence,  they  tend  to  fall  within  two  main  approaches: 

•  a  model-theoretic  approach  that  takes  the  satisfaction  relation  between  models 
and  sentences  as  basic,  and 

•  a  proof-theoretic  approach  that  takes  the  entailment  relation  between  sets  of 
sentences  as  basic. 

I  have  found  that  neither  of  these  approaches  is  by  itself  sufficient  to  axiomatize  logic 
programming,  and  that  similar  difficulties  remain  for  many  other  computer  science 
applications.  The  axioms  presented  here  unify  both  approaches,  and  yield  as  one  of 
their  fruits  the  desired  axioms  for  logic  programming. 

The  entailment  relation  asserts  provability,  but  says  nothing  about  how  a  sentence 
is  actually  proved.  To  account  for  proofs  and  for  their  internal  structure,  I  also  present 
a  new  axiomatic  notion  of  proof  calculus  that  is  very  general  and  does  not  favor  any 
particular  proof  theory  style.  The  computer  science  counterpart  of  a  proof  calculus  is 
the  notion  of  an  “operational  semantics.”  The  flexibility  of  the  axioms  given  for  proof 
calculi  permits  putting  them  and  operational  semantics  on  a  common  abstract  basis. 
This  offers  the  possibility  of  a  more  intense  mutual  interaction  between  the  two  fields. 
As  a  fruit  of  this  interaction,  operational  semantics  could  be  placed  on  a  firmer  logical 
basis  and  proof  theory  could  be  enriched  with  new,  more  flexible,  proof  systems.  The 
value  of  establishing  a  closer  link  between  proof  theory  and  operational  semantics  has 
been  recognized  by  many  authors,  and  has  led  to  specific  proposals  such  as  the  one  by 
G.  Plotkin  [57];  it  is  also  emphasized  in  the  recent  work  of  J.-Y.  Girard  [20]. 

The  methods  of  category  theory  have  taught  us  that  the  crucial  mathematical 
properties  of  a  given  subject  do  not  reside  in  the  structures  in  question,  and  even  less 
in  the  particular  representations  chosen  for  those  structures.  Rather,  they  reside  in 
the  mappings  that  preserve  those  structures.  In  our  present  case,  the  structures  are 
logics  themselves  and  the  mappings  should  be  natural  ways  of  interpreting  one  logic,  or 
one  proof  calculus,  into  another.  Although  instances  of  such  interpretations  abound 
in  logic,  and  are  of  great  practical  importance  in  computer  science,  most  existing 
axiomatic  treatments  of  general  logics,  with  the  exception  of  Goguen  and  Burstall’s 
theory  of  institutions  [27,26]  and  work  related  to  institutions,  do  not  include  a  formal 
treatment  of  such  mappings.  In  this  paper,  mappings  of  this  sort  play  a  central 
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role.  New  general  notions  of  mapping  between  entailment  systems,  between  logics, 
and  between  proof  systems,  are  given  and  illustrated  with  examples.  The  mappings 
in  question  are  far  more  general  than  syntactic  mappings  translating  languages  in 
one  logic  into  languages  in  another;  they  can  also  map  languages  into  theories,  or 
even  perform  theory  transformations  of  a  more  general  nature.  In  addition,  they  also 
provide  a  systematic  way  of  relating  models  in  different  logics.  Such  notions  of  mapping 
should  have  wide  applicability  in  logic.  They  also  have  many  potential  applications  in 
computer  science. 

Axiomatic  treatments  such  as  “abstract  model  theory”  usually  come  with  a  built-in 
notion  of  structure,  such  as  a  set-theoretic  structure  with  functions,  relations,  etc.  Al¬ 
though  very  useful  within  their  own  scope,  such  approaches  lack  the  flexibility  needed 
to  deal  with  logics  such  as  nonclassical  and  higher  order  logics,  for  which  the  appropri¬ 
ate  notion  of  “model”  may  be  widely  different  from  traditional  set-theoretic  structures. 
For  many  of  these  nonstandard  logics  the  categorical  approach  initiated  by  F.W.  Law- 
vere  [45,46]  is  ideally  suited.  In  particular,  there  is  at  present  a  thriving  interest  in 
categorical  logics  in  connection  with  applications  of  constructive  type  theory  to  pro¬ 
gramming  languages  and  to  concurrent  systems.  To  demonstrate  the  flexibility  of  the 
axioms  proposed  in  this  paper,  I  give  a  general  definition  of  categorical  logic,  discuss 
examples,  study  their  main  properties,  and  show  how  any  such  logic  is  a  particular 
instance  of  a  logic  in  the  sense  proposed  here.  I  reexamine  this  topic  in  connection 
with  the  axioms  for  logic  programming,  in  order  to  show  how  in  this  way  functional 
programming  languages  based  on  type  theory  can  be  naturally  unified  with  logic  pro¬ 
gramming. 

The  main  purpose  of  this  paper  is  to  propose  new  concepts  that  I  believe  can  be 
useful  in  many  areas  of  computer  science  and  also  in  logic.  Given  this  purpose,  there  me 
more  definitions  and  examples  than  there  are  theorems;  this  situation  should  of  course 
be  reversed  in  the  future.  Indeed,  the  paper  as  a  whole  is  in  a  sense  more  a  promise  of 
things  to  come  than  an  actual  fulfillment  of  that  promise.  The  concept  of  a  logic  as  a 
harmonious  relationship  between  entailment  and  satisfaction  is  particularly  simple  and, 
once  the  observation  is  made,  seems  the  obvious  thing  to  do.  The  concept  of  a  proof 
calculus  is  less  obvious  and  is  perhaps  one  of  the  main  new  contributions;  I  believe  that 
for  computer  science  applications  one  needs  to  have  great  flexibility  about  what  counts 
as  a  “proof;”  therefore,  I  have  avoided  making  any  commitments  to  particular  proof 
theory  styles  in  the  proof  calculus  axioms.  Many  future  computer  science  applications 
of  these  ideas  will  heavily  use  mappings  between  logics,  between  proof  calculi,  etc.  I 
believe  that  mappings  may  prove  to  be  the  most  important  concept.  They  play  a  key 
role  in  relating  different  logics  and  different  computer  systems,  and  such  relationships 
are  conceptually  and  practically  very  important.  Also,  experience  with  them  may 
suggest  new  methods  for  the  rigorous  design  and  development  of  computer  systems. 
Two  topics  with  more  particular  scopes  but  still  quite  important  axe  categorical  logics 
— specially  in  their  applications  to  type  theory  and  to  concurrency —  and  the  axioms 
for  logic  programming,  because  of  the  new  possibilities  for  language  design  that  they 
suggest. 

The  paper  assumes  an  acquaintance  with  elementary  concepts  of  category  theory, 
such  as:  category,  subcategory,  functor,  natural  transformation  and  natural  equiva¬ 
lence,  horizontal  and  vertical  composition  of  natural  transformations,  pullbacks  and 
pushouts,  and  adjoint  functors.  All  the  relevant  concepts  axe  clearly  explained  in  Mac 
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Lane’s  book  [47]. 


1.1  General  Logics 

As  already  mentioned,  one  can  speak  of  two  approaches  to  the  axiomatization  of 
general  logics,  a  model-theoretic  approach  that  focuses  on  the  satisfaction  relation 

M  |=  <p 

between  a  model  M  and  a  sentence  <pt  and  a  proof-theoretic  approach  that  seeks  to 
asdomatize  the  entailment  relation 

ri-v? 

between  a  set  of  sentences  V  and  a  sentence  <p  derivable  from  T.  The  model-theoretic 
approach  is  exemplified  by  Barwise’s  axioms  for  abstract  model  theory  [3]  (see  also 

[16]  and  the  volume  [2]  for  a  good  survey  of  this  field).  The  framework  of  institutions 
of  Goguen  and  Burstall  [27,26]  also  belongs  to  this  model-theoretic  approach,  but  it 
achieves  much  greater  generality  by  using  category  theory  and  avoiding  a  commitment 
to  particular  notions  of  “language”  and  “structure.”  The  proof-theoretic1  approach 
has  a  long  tradition  dating  back  to  work  of  Tarski  [69]  on  “consequence  relations”  and 
of  Hertz  and  Gentzen  on  the  entailment  relation  K  This  approach  owes  much  to  the 
work  of  Dana  Scott  [62]  and  other  authors.  The  recent  work  of  Fiadeiro  and  Semades 

[17]  belongs  to  this  proof-theoretic  tradition,  but  uses  methods  from  the  theory  of 
institutions;  however,  that  work  does  not  propose  a  common  axiomatization  connecting 
model  theory  and  proof  theory.  In  my  attempt  to  axiomatize  logic  programming 
languages,  I  have  found  that  neither  of  these  two  approaches  is  enough  by  itself  to  yield 
a  satisfactory  axiomatization.  This  paper  proposes  a  unified  approach  that  integrates 
model- theoretic  and  proof-theoretic  aspects  into  a  single  axiomatization  of  a  logic. 
The  axiomatization  in  question  is  quite  simple.  It  consists  of  an  “entailment  system” , 
specifying  an  entailment  relation  h,  together  with  a  “satisfaction  system”  (specifically, 
an  institution  in  the  Goguen-Burstall  sense)  specifying  a  satisfaction  relation  f=.  The 
entailment  and  satisfaction  relations  are  then  linked  by  a  soundness  axiom.  Therefore, 
institutions  provide  the  model-theoretic  component  of  a  logic  in  the  precise  sense  given 
to  this  term  in  this  paper.  As  done  in  the  theory  of  institutions,  the  axioms  for  a  logic 
are  expressed  in  a  categorical  way,  to  avoid  building  in  particular  choices  of  structures 
or  languages.  As  a  consequence,  they  are  very  f  uersl. 

1.2  Proof  Calculi 

The  entailment  relation  h  says  nothing  about  the  internal  structure  of  a  proof.  To  have 
a  satisfactory  account  of  proofs,  we  need  the  additional  concept  of  a  “proof  calculus” 
P  for  a  logic  Z.  The  definition  of  a  proof  calculus  is  very  general,  and  does  not  favor 
any  particular  style  of  proof  theory.  The  same  logic  may  of  course  have  many  different 
proof  calculi.  Therefore,  when  wishing  to  include  a  specific  proof  calculus  as  part  of 
a  logic,  the  resulting  logic  plus  proof  calculus  is  instead  called  a  logical  system.  The 
axioms  for  a  proof  calculus  P  state  that  each  language  (also  called  a  signature)  in 
the  logic  Z  has  an  associated  space  of  proofs,  which  is  an  object  of  an  appropriate 

lTlus  is  an  oversimplification,  since  semantic  considerations  are  also  included  in  it. 
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category.  From  such  a  space  we  can  then  extract  the  actual  set  of  proofs  supporting  a 
given  entailment 

We  actually  need  the  more  general  concept  of  a  “proof  subcalculus”  where  proofs 
are  restricted  to  some  given  class  of  axioms  and  conclusions  are  also  restricted  to  some 
given  class  of  sentences.  It  is  by  systematically  exploiting  such  restrictions  that  the 
structure  of  proofs  can  be  simplified;  for  example,  we  can  in  this  way  arrive  at  proof 
theories  that  are  efficient  and  can  be  used  in  practice  as  the  operational  semantics  of  a 
logic  programming  language.  For  programming  and  other  purposes  we  need  not  just 
proof  subcalculi,  but  effective  ones. 

1.3  Relating  Logics 

Interpretations  mapping  one  logic,  or  one  proof  theory,  into  another  are  very  common 
in  logic.  They  permit  the  transfer  of  results  between  different  logics  and  serve  to 
provide  a  relative  foundation  for  a  logical  system  by  reducing  it  to  a  simpler  one. 
In  computer  science  there  are  compelling  practical  reasons  for  establishing  similar 
mappings.  Even  if  not  explicitly  recognized  as  such,  the  need  for  mappings  of  this  sort 
manifests  itself  in  a  variety  of  ways.  For  example,  for  programming  languages  it  may 
take  the  form  of  trying  to  combine  language  features  from  different  languages  based 
on  different  logics,  or  of  trying  to  compile  an  inefficient  operational  semantics  into  an 
efficient  one.  Since  the  need  is  there,  like  a  river  that  nobody  can  stop,  something  of 
the  sort  will  be  done.  However,  if  the  approach  taken  lacks  a  logical  basis  to  serve 
as  a  criterion  for  correctness  the  result  may  be  quite  ad  hoc  and  unsatisfactory,  it 
will  probably  involve  a  good  deal  of  costly  engineering  trial  and  error,  and  may  in 
the  end  lack  a  clear  intellectual  content,  making  it  impossible  to  be  transmitted  as  a 
lasting  scientific  contribution.  Also,  since  the  development  of  computer  systems  is  a 
very  expensive  and  labor  intensive  activity,  there  is  a  great  need  for  reusing  systems  as 
much  as  possible.  One  may  for  example  want  to  use  a  theorem  prover  in  one  logic  to 
prove  theorems  in  a  different  logic,  and  the  soundness  of  such  a  procedure  must  then 
be  justified  by  an  appropriate  map  of  logics. 

This  work  proposes  a  general  axiomatic  theory  of  mappings  that  is  flexible  enough 
to  encompass  and  unify  the  maps  used  in  logic  and  the  maps  needed  in  computer  sci¬ 
ence.  The  most  promising  computer  science  application  of  this  theory  is  probably  not 
to  the  a  posteriori  justification  of  computer  science  practice,  but  to  the  development 
of  new  methods  for  the  rigorous  design  of  computer  systems.  For  the  model-theoretic 
component,  the  maps  are  similar  to  the  institution  morphisms  of  Gogue-.  and  Burstall 
[27,26],  which  provided  the  original  inspiration.  In  addition,  the  following  other  types 
of  mappings  are  defined: 

•  Maps  of  entailment  systems,  relating  languages  in  different  logics  and  preserving 
the  entailment  relation. 

•  Maps  of  proof  calculi,  which  in  addition  relate  proofs  in  different  logics. 

•  Maps  of  logics,  that  map  their  underlying  entailment  systems  and  their  under¬ 
lying  institutions. 

•  Maps  of  logical  systems,  that  map  botn  logics  and  their  corresponding  proof 
calculi. 
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As  already  mentioned,  these  mappings  permit  very  general  translations.  They  are  not 
restricted  to  translating  one  language  into  another,  but  allow  complex  transformations 
between  theories  in  different  logics  that  also  occur  in  practice. 

1.4  Categorical  Logics 

Axiomatizing  a  notion  always  presents  a  dilemma.  In  trying  to  achieve  generality,  an 
axiomatization  can  become  too  weak  and  somewhat  vacuous  in  its  results.  Yet,  if  the 
axiomatization  is  too  specific,  it  will  fail  to  include  relevant  examples  and,  furthermore, 
it  may  hinder  subsequent  developments.  The  goal  of  Section  5  is  to  show  by  example 
that  the  proposed  notion  of  logic  has,  indeed,  a  wide  applicability.  I  axiomatize  a 
large  and  important  class  of  logics,  namely  categorical  logics,  as  particular  instances 
of  the  general  notion  of  logic  proposed  in  this  paper,  show  that  they  have  remarkably 
nice  model-theoretic  properties,  and  prove  a  general  structure-semantics  adjointness 
theorem. 

The  basic  insight  provided  by  categorical  logic  in  the  sense  of  Lawvere  is  that  the 
essential  aspects  of  a  logic,  independently  of  particular  syntactic  choices  that  must  be 
made  to  talk  about  it,  are  categorical  properties.  For  example,  the  key  categorical 
property  behind  the  linguistic  notion  of  an  equation  is  the  existence  of  finite  products. 
Semantics  can  then  be  made  functorial,  by  reinterpreting  theories  as  categories  with 
appropriate  structure  and  viewing  models  as  functors  that  preserve  that  structure. 

1.5  Axioms  for  Logic  Programming 

The  logic  programing  dream  has  only  begun  to  be  realized.  So  far,  the  overwhelming 
majority  of  work  in  this  area  has  dealt  with  the  Horn  clause  fragment  of  first  order  logic 
but  the  idea  is  obviously  much  broader,  and  its  potential  in  other  contexts  remains 
to  a  good  extent  unexploited.  This  paper  tries  to  contribute  to  a  scientific  discussion 
of  logic  programming  concepts  in  as  broad  a  context  as  possible.  It  does  so  by  using 
the  concepts  proposed  for  general  logics  as  the  basis  for  an  axiomatic  notion  of  logic 
programming.  The  axioms  proposed  are  a  further  step  in  a  series  of  attempts  to 
make  precise  a  broad  view  of  logic  programming  shared  with  Joseph  Goguen.  This 
view  was  expressed  informally  in  the  joint  paper  [32],  and  formally  by  Joseph  Goguen 
using  the  notion  of  an  institution  [24].  Logic  programming  presents  a  very  interesting 
combination  of  proof  theory  and  model  theory;  the  axioms  that  I  propose  try  to  cover 
and  relate  both  aspects.  I  also  include  a  discussion  of  different  logic  programming 
languages  and  styles.  Within  first  order  logic  there  are  already  several  possibilities 
that  are  quite  different  from  the  traditional  Prolog  style.  A  direct  fruit  of  the  study  of 
categorical  logics  undertaken  in  Section  5  is  the  establishment  of  a  precise  connection 
between  the  very  active  research  area  of  functional  programming  languages  based  on 
constructive  type  theory  and  higher  order  instances  of  logic  programming. 
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way  for  their  kind  encouragement  and  their  important  technical  suggestions  along  the 
maturing  process  that  these  ideas  have  undergone  since  they  were  first  presented  at  the 
Logic  Colloquium  in  Granada.  Joseph  Goguen  provided  also  very  helpful  comments 
to  the  final  draft. 

It  is  a  pleasure  for  me  to  express  my  warmest  thanks  to  F.  William  Lawvere  with 
whom  I  have  discussed  the  ideas  of  this  paper  in  many  conversations;  beyond  his  many 
important  suggestions,  I  owe  to  him  a  crucial  influence  in  my  ways  of  understanding 
logic  and  mathematics  that  goes  back  to  my  years  as  a  graduate  student  and  has  been 
a  permanent  inspiration  ever  since. 

An  earlier  version  benefited  from  very  valuable  suggestions  by  Jon  Barwise,  Keith 
Clark,  Brian  Mayoh,  Gordon  Plotkin,  Johan  van  Benthem  and  an  anonymous  referee 
that  have  led  to  substantial  improvements  and  I  am  indebted  to  them  for  their  kind 
criticism.  Narciso  Martf-Oliet  deserves  special  thanks  for  having  suggested  numerous 
improvements  to  several  drafts  and  for  his  generous  help  with  the  preparation  of  the 
IATjjX  diagrams. 

2  General  Logics 

The  proof-theoretic  and  model-theoretic  approaches  to  general  logics  are  unified  in  the 
following  way.  First,  I  introduce  the  notion  of  an  entailment  system  that  formalizes  the 
provability  relation;  the  axioms  are  similar  to  those  of  Scott  [62],  but  they  also  account 
for  translations  across  different  signatures.  Given  an  entailment  system,  we  can  define 
the  notions  of  a  theory  and  a  theory  morphism.  The  model-theoretic  aspects  of  a  logic 
are  covered  by  Goguen  and  Burstall’s  notion  of  an  institution  [27,26].  The  notion  of  a 
logic  is  then  obtained  by  combining  an  entailment  system  with  an  institution  in  such 
a  way  that  a  soundness  condition  (relating  provability  and  satisfaction)  holds.  The 
logic  is  called  complete  if,  in  addition,  satisfaction  implies  provability.  I  also  show  how 
a  complete  logic  can  naturally  be  associated  with  an  entailment  system  or  with  an 
institution. 
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2.1  Ezxtailment  Systems 

Consider  the  familiar  example  of  first  order  logic.  We  usually  fix  a  particular  signature 
E,  consisting  of  a  pair  (F,  P )  where  F  is  a  ranked  alphabet  of  function  symbols  and 
P  a  ranked  alphabet  of  predicate  symbols.  Relative  to  such  a  signature,  we  have  a 
notion  of  a  sentence .  The  rules  of  first  order  logic  then  specify  for  us  an  entailment 
relation, 

V\-<p 

which  holds  between  a  set  of  sentences  T  and  a  sentence  <p  when  <p  is  derivable  from 
T.  For  the  moment,  we  shall  make  abstraction  of  the  particular  rules  used  to  generate 
the  relation  h  and  concentrate  on  the  relation  itself.  Indeed,  the  entailment  relation 
plays  a  more  central  role,  since  it  remains  the  same  across  the  many  different  proof 
calculi  that  exist  for  first  order  logic. 

The  first  order  entailment  relation  H  satisfies  three  basic  properties  that  can.  more 
generally  be  justified  on  intuitive  grounds  as  properties  that  any  reasonable  entailment 
relation  should  indeed  satisfy.  These  are: 

•  • 

•  reflexivity ,  i.e.,  we  can  always  prove  a  sentence  if  we  can  assume  it; 

•  monotonicity ,  i.e.,  we  can  always  prove  with  more  assumptions  what  we  can 
prove  with  fewer,  and 

•  transitivity ,  i.e.,  using  as  an  additional  assumption  something  already  proved 
should  not  give  us  more  conclusions  than  those  already  entailed  by  our  original 
assumptions. 

We  may  at  times  wish  to  change  the  signature  E  and  move  to  a  new  signature  E\ 
The  signature  E'  may  just  be  an  enlargement  of  our  original  syntax.  In  other  cases, 
E'  may  be  a  genuinely  different  signature  so  that  a  translation  of  the  old  symbols  to 
the  new  ones  must  take  place.  In  any  event,  the  general  form  of  such  translations  H  : 
E  — ♦  E',  called  signature  morphisms ,  consists,  for  first  order  signatures,  of  a  pair  of 
rank-preserving  functions,  one  for  function  symbols  and  another  for  predicate  symbols, 
both  used  for  mapping  the  old  symbols  into  the  new  ones.  Therefore,  signatures  form  a 
category  Sign  with  signatures  as  objects  and  signature  morphisms  as  morphisms.  Any 
signature  morphism  JET  induces  a  corresponding  translation  at  the  level  of  sentences, 
so  that  we  can  associate  to  each  E-sentence  <p  a  corresponding  E'-sentence  H{<p)  by 
systematically  replacing  the  old  symbols  by  the  new  ones,  according  to  the  signature 
morphism  H.  All  this  can  be  expressed  in  a  compact  way  by  saying  that  the  process 
associating  to  each  signature  E  its  set  of  sentences  sen(E)  is  in  fact  a  functor  sen  : 
Sign  — ►  Set  from  the  category  of  signatures  to  the  category  of  sets.  We  of  course 
have  no  problem  accepting  the  fact  that  entailments  are  stable  under  translation  by 
signature  morphisms:  i.e.,  if  we  were  able  to  prove  a  conclusion  from  some  axioms, 
we  are  then  able,  for  any  translation  H,  to  prove  the  translated  conclusion  from  the 
translated  axioms. 

Definition  1  An  entailment  system  is  a  triple  £  =  {Sign,  sen,  h)  with  Sign  a  category 
whose  objects  are  called  signatures,  sen  a  functor3  sen  :  Sign  — *  Set  and  t-  a  function 

3 By  convention,  the  function  sen(H)  associated  by  the  fnnetor  sen  to  a  signature  morphism  H  will 
also  be  denoted  by  H . 
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associating  to  each  E  in  Sign  a  binary  relation  bs  C  P(sen( E))  x  sen(E)  called  E- 
entailment  such  that  the  following  properties  are  satisfied: 

1.  reflexivity.  for  any  <p  €  sen(E),  {y?}  \~z  <p ; 

2.  monotonicity:  if  T  He  <p  and  V  D  T  then  T'  hg  <p\ 

3.  transitivity,  if  T  I~e  <Pi,  for  t  6  I,  and  T  U  {<p,-  \  i  €  /}  bj  ip,  then  T  b^  ip; 

4.  -translation:  if  T  bs  tp,  then  for  any  H  :  E  — *  E'  in  Sign,  fT(r)  bjj#  H{p). 

In  addition,  an  entailment  system  is  called  compact  if  whenever  r  \~z  <p,  then  we 
can  find  a  finite  To  Q  T  such  that  To  bs  <p.  □ 

Remarks: 

1.  The  reflexivity,  monotonicity  and  transitivity  axioms  are  similar  to  those  given 
by  Scott  in  [62].  However,  the  present  formulation  avoids  any  compactness  as¬ 
sumptions. 

2.  The  entailment  relation  b  induces  a  function  mapping  each  set  of  sentences  T  to 
the  set  T*  =  {<p  |  T  b  ip}.  We  call  T*  the  set  of  theorems  provable  from  T.  It 
follows  easily  from  the  reflexivity,  monotonicity  and  transitivity  axioms  that  the 
assignment  r  T*  is  a  closure  function,  i.e.,  it  satisfies: 

(a)  rcr 

(b)  rcr  =>  r*  c  r* 

(c)  r**  =  r* 

Tarski’s  original  axioms  [69]  were  given  in  terms  of  this  closure. 

3.  I  am  indebted  to  Joseph  Goguen  for  pointing  out  to  me  that  the  recent  work  of 

Fiadeiro  and  Semades  on  tt7r-institutions”  [17]  proposes  a  notion  quite  similar 
to  entailment  systems.  Their  notion  is  expressed  in  terms  of  a  closure  operator 
and  includes  a  compactness  assumption.  3 

4.  The  b-translation  axiom  can  be  equivalently  expressed  by  saying  that  b  is  a 
functor  b  :  Sign  — ►  Set,  that  is  a  subfunctor  of  the  functor  mapping  each 
signature  E  to  the  set  P(sen( E))  x  sen( E). 

5.  One  should  keep  in  mind  that  the  entailment  relation  b  is  independent  of  any 
rules  for  its  generation.  Therefore,  the  reflexivity,  monotonicity  and  transitivity 
conditions  should  be  viewed  as  abstract  properties  of  b,  and  should  not  be  con¬ 
fused  with  particular  rules  of  a  specific  proof  calculus  for  generating  b,  so  that, 
say,  reflexivity  would  then  be  understood  as  an  axiom  scheme,  monotonicity  as  a 
weakening  rule,  and  transitivity  as  a  cut  rule.  In  fact,  it  may  very  well  be  the  case 
that  an  entailment  system  satisfying  the  reflexivity,  monotonicity  and  transitiv¬ 
ity  conditions  is  generated  by  a  proof  calculus  that  rejects  most  structural  rules 
and  imposes  restrictions  on  cut.  For  example,  in  linear  logic  [22]  weakening  and 
contraction  are  forbidden  so  that  the  calculus  is  in  a  sense  “nonmonotonic.”  We 
have  the  sequent  A  —>  A  as  an  axiom,  but  we  cannot  derive  either  A,  B  — ►  A  or 
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even  A,  A  — *  A  as  consequences.  The  point  is  that,  for  E  a  linear  logic  signature, 
the  elements  of  sen(E)  should  not  be  identified  with  formulas  but  with  sequents. 
Viewed  as  a  way  of  generating  sequents,  i.e.,  identifying  our  h  with  the  closure  of 
the  horizontal  bar  relation  among  linear  logic  sequents,  the  entailment  of  linear 
logic  is  indeed  reflexive,  monotonic  and  transitive.  The  idea  that  sequents  are 
a  good  choice  for  a  notion  of  sentence  in  linear  logic  is  very  much  in  keeping 
with  Girard’s  intuition  of  a  sequent  A-+  B  as  an  action  from  A  to  B;  it  is  also 
supported  by  recent  work  on  categorical  models  for  linear  logic  [64,14,56]. 

2.2  Theories 

Given  a  signature  E,  a  theory  is  presented  by  a  set  T  of  E-sentences  called  its  axioms. 
We  can  therefore  define  a  theory  as  a  pair  T  =  (E,  T).  For  some  purposes,  one  deals  not 
with  the  original  axioms  T  but  rather  with  their  closure  under  entailment  T*,  so  that  it 
is  tempting  to  call  T  =  (E,T)  a  presentation  if  the  theory  T  =  (E,r*).  However,  the 
view  of  theories  as  presentations  allows  us  to  make  finer  distinctions  that  are  important 
for  both  proof-theoretic  and  computational  purposes.  We  can,  for  example,  distinguish 
between  a  sentence  that  is  a  basic  axiom  and  another  that  is  a  derived  theorem.  Also, 
although  (E,  T)  and  (E,  T*)  are  isomorphic  in  the  general  category  Th  of  theories,  they 
are  not  isomorphic  in  a  more  restricted  but  useful  category  of  axiom-preserving  theory 
morphisms. 

Definition  2  Given  an  entailment  system  i ,  its  category  Th  of  theories  has  as  objects 
pairs  T  =  (E,  T)  with  E  a  signature  and  T  C  sen(E)  and  as  morphisms,  H  :  (E,  T)  — ► 
(E',r),  called  theory  morphisms ,  signature  morphisms  H  :  E  — ►  E'  such  that  if 
<p  €  T,  then  T  H(<p). 

A  theory  morphism  H  :  (E,  T)  — ►  (E',  I”)  is  called  axiom-preserving  iff  it  satisfies 
the  condition  that  H(T)  C  I”.  This  defines  a  subcategory  Thp  with  the  same  objects 
as  Th  but  with  morphisms  restricted  to  be  axiom-preserving  theory  morphisms.  Since 
given  an  arbitrary  theory  morphism  H  :  (E,T)  — ►  (E\r'j  the  theories  (S',  I”)  and 
(E',r'Uff(r))  are  always  isomorphic,  the  restriction  to  Thp  is  not  very  serious.  Notice 
that  the  category  Thp  does  not  depend  at  all  on  the  entailment  relation  h,  i.e.,  any 
other  entailment  system  with  identical  signatures  and  sentences  will  yield  the  same 
The.  □ 


Remarks: 

1.  Projection  to  the  first  component  yields  a  forgetful  functor  sign  :  Th  — ►  Sign  : 
(E,  T)  • — ►  E.  Associating  to  each  signature  E  the  theory  (E,  0)  yields  a  functor  F  : 
Sign  — *  Th  left  adjoint  to  sign ,  i.e.,  such  that  there  is  a  natural  isomorphism 
Ih^OQ.r)  =  SignQ:,sign(T)). 

2.  The  category  Th  ia  equivalent  to  the  full  subcategory  determined  by  theories  of 
the  form  T  =  (E,T*).  Theory  morphisms  H  :  (E,f*)  — ►  (E',H*)  are  always 
axiom-preserving,  i.e.,  they  satisfy:  H(T*)  C  T".  This  corresponds  to  the  dis¬ 
tinction  between  entailment  closed  theories  and  presentations  mentioned  above. 
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3.  By  composing  with  the  forgetful  functor  sign  :  Th  — ►  Sign,  we  can  extend 
the  functor  sen  :  Sign  — ►  Set  to  a  functor  sen  :  Th  — ►  Set,  i.e.,  we  define 
sen(T)  =  sen(sign[T)) . 

4.  The  assignment  to  each  theory  T  =  (E,l?)  of  the  set  T*  of  its  theorems  is  a 
functor  thm  :  Th  — ►  Set.  Indeed,  thm  is  a  subfunctor  of  the  functor  sen  just 
defined  for  theories. 

5.  We  can  extend  the  functor  b  :  Sign  — ►  Set  to  a  functor  b:  Th  — ►  Set  by 
defining  A  H-(e,t)  ^  iff  A  U  T  hs  jj.  The  original  b  :  Sign  — ►  Set  is  then 
recovered  by  composing  with  the  functor  F  :  Sign  — ►  Th. 

2.3  Institutions 

In  first  order  logic,  given  a  sign.  jEwe  associate  to  it  a  category  ModCL).  Its 
objects,  called  11-models  (or  E-structures),  consist  of  a  set  together  with  an  interpreta¬ 
tion  of  each  n-ary  function  symbol  as  an  n-ary  operation  and  of  each  n-ary  predicate 
symbol  as  an  n-ary  predicate.  Its  morphisms  are  functions  that  preserve  the  opera¬ 
tions  and  the  predicates.  Given  a  E-model  M  and  a  E-sentence  <p  we  have  the  notion 
of  satisfaction  of  the  sentence  <p  by  the  model  M,  written  M  |=£  <p. 

A  signature  morphism  H  :  E  — ►  E'  allows  us  to  view  a  E'-model  M'  as  a  E-model 
ff^(M'),  just  by  giving  to  each  function  symbol  /  in  E  the  interpretation  of  H{f)  in  M', 
and  doing  the  same  for  predicate  symbols.  This  extends  trivially  to  homomorphisms, 
so  that  we  have  a  functor  £Tk  :  Mod(H')  — ►  Mod(S).  Globally,  this  means  that  Mod 
is  actually  a  functor  Mod  :  Sign 09  — ♦  Cat,  from  the  dual  of  the  category  of  signatures 
(same  objects,  reversed  direction  for  morphisms)  to  the  category  of  categories,  where 
we  have  adopted  the  notation  ModlH)  =  H* .  It  follows  easily  from  the  definition  of 
the  satisfaction  relation  that  satisfaction  is  invariant  under  the  process  of  changing 
signatures,  i.e., 

H^M1)  (=s  >p  iff  Af  (=s.  H{ip). 

Definition  3  An  institution  [27,26]  is  a  quadruple  I  =  [Sign,  sen.  Mod.  (=)  with  Sign 
a  category  whose  objects  are  called  signatures,  sen  :  Sign  — ►  Set  a  functor  associating 
to  each  signature  a  set  of  sentences,  Mod  :  Sign0*  — ►  Cat  a  functor  associating  to 
each  signature  a  corresponding  category  of  models?,  and  f=  a  function  associating  to 
each  signature  E  a  binary  relation  ]=s  Q  \Mod[ E)|  x  sen(E),  called  satisfaction,  where 
| Mod[H)\  denotes  the  class4  of  all  objects  in  the  category  Mod(L)  in  such  a  way  that 
the  following  property  holds  for  any  M'  £  ModlH').  H  :  E  — ►  E',  <p  £  sen(E): 

|=- invariance :  H^(M')  ^=e  <p  iff  Af  H[p). 


□ 

3  Aa  before,  on  morphia ma  we  adopt  the  notation  ModlH)  =  H* . 

4We  shall  not  worry  about  foundational  iasues  here.  Let  whoever  worries  take  refuge  in  a 
Grothendieck  universe! 


Given  a  set  of  E-sentences  T,  we  define  the  category  Afod(E.r)  as  the  full  sub¬ 
category  of  ModCL)  determined  by  those  models  M  €  Mod(E)  that  satisfy  ail  the 
sentences  in  T,  i.e.,  M  <p  for  each  <p  €  T.  We  can  define  a  relation  between  sets  of 
sentences  and  sentences,  also  denoted  ^=,  as  follows: 

T  [=s  ip  iff  M  (=s  V  for  each  M  €  Mod(E.T). 

The  naturalness  of  the  definition  of  ent ailment  system  given  in  Section  2.1  is  rein¬ 
forced  by  the  fact  that  any  institution  yields  an  entailment  system. 

Proposition  4  For  J  =  [Sign,  sen,  Mod.  ^=)  an  institution,  the  triple  I+  =  (Sign, 
sen,  f=),  with  now  denoting  the  associated  relation  between  sets  of  sentences  and 
sentences,  is  an  entailment  system.  □ 

Of  course,  since  this  entailment  system  has  been  defined  by  entirely  model-theoretic 
methods,  we  should  not  in  general  expect  to  find  an  effective  proof  calculus  to  generate 
it.  However,  for  1  the  first  order  logic  institution,  the  completeness  theorem  assures 
us  that  I+  coincides  with  the  entailment  system  for  first  order  logic  already  discussed 
in  Section  2.1;  this  of  course  can  be  generated  from  a  variety  of  effective  proof  calculi. 

We  shall  denote  by  Th^-  the  category  of  theories  associated  to  the  entailment  system 
I+.  Let  H  :  (E,T)  — ►  (E',r)  be  a  theory  morphism  in  Th given  peTwe  have 
r'  ^=£*  H(<p)  by  definition  of  theory  morphisms.  Therefore,  for  any  M1  6  Mod( E',  F') 
we  have  M1  {=£•  H(<p)  which  is  equivalent  to  )  (=s  <p  and  consequently  ff*(M*)  € 

Mod(E.T).  This  shows  that  the  functor  13*  :  Mod(E')  — ►  Mod( S)  restricts  to  a 
functor: 


&  :  Mod(E'.T')  —+  Mod(Z.T). 

Globally,  this  means  that  we  can  extend  our  original  functor  Mod  :  Sign0*  — ►  Cat  to 
a  functor: 


Mod  :  Th ^sop  — ►  Cat. 

We  shall  sail  an  institution  I  liberal  [27,26]  if  for  any  theory  morphism  H  :  T  — ►  V 
in  Th^  the  functor  J3*  :  Mod(V)  — ►  Mod(T)  always  has  a  left  adjoint,  denoted  S*, 

i.e.,  there  is  a  natural  isomorphism  ModiT,'\{H*(M'\.M,'\  =  Mod(T ) (M.  H* (M1)  1 . 
First  order  logic  is  not  liberal,  but  (first  order)  equational  logic  and  Horn  logic  are. 
Liberality  of  an  institution  is  an  abstract  measure  of  a  logic’s  algebraic  character. 
Lawvere  showed  in  his  thesis  [45]  that  all  the  usual  free  constructions  of  algebra  are 
direct  consequences  of  the  general  fact  that  equational  logic  is  a  liberal  institution. 
For  example,  for  T  the  theory  of  commutative  monoids,  V  the  theory  of  commutative 
rings,  and  H  :T  — ►  V,  the  theory  morphism  that  interprets  the  monoid  operation  as 
ring  multiplication,  the  functor  H*,  left  adjoint  to  H*,  is  the  monoid- ring  construction 
that,  for  free  commutative  monoids,  specializes  to  the  polynomial  ring  construction. 

We  say  that  an  institution  I  admits  initial  models  if  for  any  theory  T  €  Th^  the 
category  Mod(T)  has  an  initial  object,  denoted  It-  The  general  definition  of  initial 
objects  is  as  follows. 

Definition  5  In  any  category  A  an  object  I  is  said  to  be  initial  if  for  any  object  X  in 
A  there  is  a  unique  morphism  /  — *■  X  in  A.  □ 
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We  shall  call  an  institution  I  exact  if  the  functor  Mod  :  Th — ►  Cat  preserves 
pullback  diagrams.  ~~™ 

The  institution  of  first  order  equational  logic  is  exact  and  admits  initial  models, 
which  are  the  relatively  free  algebras  on  an  empty  set  of  generators.  For  example,  for 
the  theory  of  rings,  the  initial  algebra  is  the  ring  of  integers.  In  Section  5  we  shall 
encounter  many  other  logics  whose  underlying  institutions  are  liberal,  admit  initial 
models  and  are  exact. 

2.4  Logics 

We  are  now  ready  to  give  axioms  that  cover  both  the  provability  and  the  model- 
theoretic  sides  of  a  logic.  The  solution  is  very  simple:  a  logic  has  two  components 
consisting  of  an  entailment  system  and  an  institution  that  share  the  same  signatures 
and  sentences.  In  addition,  the  logic  must  be  sound,  i.e.,  we  must  have 

.  r  b  <p  =>  v\=ip. 

•  • 

For  complete  logics,  such  as  first  order  logic,  this  implication  is  actually  an  equivalence. 
Definition  6  A  logic  is  a  5-tuple  £  =  [Sign,  sen,  Mod,  b,  [=)  such  that: 

1.  (Sign,  sen,  b)  is  an  entailment  system; 

2.  (Sign,  sen,  Mod.  f=)  is  an  institution,  and 

3.  the  following  soundness  condition  is  satisfied:  for  any  E  6  Sign,  T  C  sen(E)  and 
tp  €  sen(E), 

r  bs  <p  =>  T  (=s  <p- 
A  logic  is  complete  if,  in  addition, 


r  br  <p  r  [=e  <p. 

A  logic  is  compact  if  its  underlying  entailment  system  is  so.  Similarly,  a  logic  is 
liberal,  admits  initial  models  or  is  exact  if  its  underlying  institution  is  so.  Given  a  logic 
£,  its  underlying  entailment  system  will  be  denoted  ent(£);  similarly,  its  underlying 
institution  will  be  denoted  inst(£).  □ 

Remarks : 


1.  In  Section  2.3  I  presented  the  original  definition  of  an  institution  as  given  by 
Goguen  and  Burstall  [27,26].  More  recently,  Goguen  and  Burstall  [28]  have 
added  the  proof-theoretic  requirement  that  the  set  sen(E)  is  a  category,  whose 
morphisms  <p  — ►  rjj  are  understood  as  proofs.  By  postulating  additional  as¬ 
sumptions,  such  as  compactness  and  existence  of  conjunction  for  sentences,  one 
could  associate  to  proofs  in  this  sense  an  entailment  relation  r  b  <p.  However, 
it  does  not  seem  possible  to  treat  in  this  way  the  general  case  of  an  arbitrary 
institution. 

5Strictly  speaking,  for  an  incomplete  logic,  compactness  can  be  a  property  of  either  I-,  or  of  f= 
(i.e.,  the  relation  between  sets  of  sentences  and  sentences  induced  by  satisfaction).  Therefore,  we  could 
speak  of  Kcompactness  and  b*comPactn«8  for  a  logic.  In  practice,  however,  if  anything  is  going  to  be 
compact  at  all,  it  will  probably  be  K 
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2.  Notice  that  the  inclusion  hs  C  f=E  is  natural  in  E.  Therefore,  the  soundness 
axiom  can  be  equivalently  expressed  by  saying  that  the  functor  H  Sign  — ►  Set. 
is  a  subfunctor  of  the  functor  [=:  Sign  — ►  Set,  and  the  completeness  axiom  by 
saying  that  the  two  functors  are  identical. 

3.  A  logic  t  determines  two  categories  of  theories  that  have  the  same  objects, 
but  in  general  have  different  morphisms.  One,  Th.  comes  from  its  underlying 
entailment  system;  the  other,  Th^,  comes  from  its  underlying  institution.  The 
soundness  axiom  gives  us  a  subcategory  inclusion  Th  *-*■  Th (,,  and  completeness 
makes  the  categories  identical. 

2.5  Going  to  Extremes 

Both  entailment  systems  and  institutions  provide  one-sided  accounts  of  logic.  The 
general  notion  of  logic  given  in  Section  2.4  has  the  pleasing  flexibility  of  allowing  us 
to  regard  an  entailment  system  or  an  institution  as  a  full  logic  of  a  special  kind.  In 
this  way,  the  proof-theoretic  and  model-theoretic  opposites  are  reconciled.  In  the  end, 
from  this  abstract  perspective,  each  can  claim  to  have  in  some  measure  what  the  other 
contended  it  lacked. 

From  our  discussion  in  Section  2.3  we  immediately  obtain  the  following, 

Proposition  7  An  institution  1  determines  a  complete  logic  having  I+  as  its  under¬ 
lying  entailment  system  and  having  I  itself  as  its  underlying  institution.  By  abuse  of 
language,  this  logic  is  also  denoted  I+.  □ 

A  somewhat  surprising  fact  is  that,  thanks  to  the  generality  of  the  axioms  for  a 
logic,  we  can  pull  a  logic  out  of  the  proof-theoretic  thin  air  of  an  entailment  system. 
Fiadeiro  and  Semades  [17]  associate  an  institution  to  a  ^-institution  in  a  somewhat 
similar  way.  The  notion  of  a  slice  category  is  used  in  the  proof  of  Proposition  9  and  will 
appear  several  other  times  in  this  paper.  Slice  categories  are  an  instance  of  Lawvere’s 
[45]  “comma  category”  construction  (see  [47]). 

Definition  8  For  B  an  object  in  a  category  C,  the  slice  category  B/C  has  as  objects 
morphisms  /  :  B  — ►  A  and  as  morphisms  from,  say,  /  :  B  — ►  A  to  g  :  B  — ►  C  those 
h  :  A  — *  C  in  C  such  that  ho  f  =  g;  morphism  composition  is  exactly  as  in  C.  Dually, 
the  slice  category  C/B  has  as  objects  morphisms  /  :  A  — ►  B  and  as  morphisms  from, 
say,  /  :  A  — *  B  to  g  :  C  — *  B  those  h  :  A  — *  C  in  C  such  that  g  o  h  =  /;  again, 
morphism  composition  is  as  in  C.  □ 

Proposition  9  We  can  associate  to  any  entailment  system  £  a  logic  V  that  has  i 
as  its  underlying  entailment  system.  Besides,  £*  is  complete,  exact,  and  admits  initial 
objects.  If  Th.  has  pushouts,  £ t  is  also  liberal. 

Proof:  For  E  a  signature  we  define  Afod(E)  as  the  slice  category  (S.0)/Th.  A 
signature  morphism  H  :  E  — ►  E'  induces  a  functor  Hk  :  Mod(E')  — ►  Afod(E)  which 
is  just  composition  with  H,  i.e.,  H^C)  =  G'  o  H.  Given  ( G  :  (E,0)  — *  (E',r'))  € 
ModCL)  satisfaction  is  defined  by  G  (=e  tp  iff  T'  Hjy  G(<p).  Therefore,  G  satisfies  a  set  of 
E-sentences  T  iff  there  is  a  theory  morphism  G  :  (E,T)  — *•  (E',T').  Thus,  for  T  6  Th, 
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Mod(T)  can  be  identified  with,  the  slice  category  T/Th.  The  logic  is  complete,  since 
the  identity  theory  morphism  1(e,t)  €  ModCE.T)  is  such  that  l(sj)  \=z  <p  iff  T  hs  ip. 
Any  slice  category  B/C  has  1b  as  its  initial  object  and  therefore  £*  admits  initial 
models,  with  It  =  It-  When  Th  has  pushouts,  exactness  and  liberality  follow  easily 
from  the  elementary  properties  of  a  pushout;  in  the  latter  case,  for  G  6  Mod(T)  and 
H  :  T  — ►  V  a  theory  morphism,  we  define  H*  (G)  as  the  pushout  of  G  along  E.  □ 


3  Proof  Calculi  and  Logical  Systems 

A  reasonable  objection  to  the  above  definition  of  logic  is  that  it  abstracts  away  the 
structure  of  proofs,  since  we  know  only  that  a  set  T  of  sentences  entails  another  sentence 
p,  but  no  information  is  given  about  the  internal  structure  of  such  a  T  b  p  entailment. 
This  observation,  while  entirely  correct,  may  be  a  virtue  rather  than  a  defect,  because 
the  entailment  relation  h  is  precisely  what  remains  invariant  under  the  many  equivalent 
proof  calculi  that  can  be  used  for  a  logic.  For  example,  in  first  order  logic  we  have 
many  different  proof  calculi:  Hilbert  styled,  sequent,  natural  deduction,  etc.,  each 
leading  to  a  different  notion  of  proof.  However,  the  logic  always  remains  the  same, 
first  order  logic,  precisely  because  all  proof  calculi  yield  the  same  entailment  relation 
K  Therefore,  rather  than  building  a  particular  proof  calculus  into  the  definition  of 
a  logic,  it  seems  more  satisfactory  to  axiomatize  separately  a  proof  calculus  P  for  a 
logic  £,,  so  that  many  different  such  calculi  can  be  used  in  connection  with  the  same 
logic.  This  point  is  directly  relevant  to  computer  science,  because  it  shows  that  we 
can  change  the  operational  semantics  (i.e.,  the  proof  calculus)  of  a  logic  programming 
language  without  altering  its  mathematical  semantics,  provided  that  the  old  and  the 
new  operational  semantics  have  the  same  entailments.  If  we  want  to  choose  a  specific 
proof  calculus  for  a  logic,  we  call  the  resulting  logic  plus  proof  calculus  a  logical  system. 
In  usual  practice,  and  specially  in  logic  programming  applications,  we  often  find  proof 
calculi  where  certain  restrictions  are  placed  on  the  signatures,  the  sentences  used 
as  axioms,  and  on  those  used  as  conclusions.  This  leads  to  the  notion  of  a  proof 
subcalctilus.  In  addition,  we  must  introduce  the  notion  of  an  effective  proof  subcalculus. 

The  basic  idea  of  a  proof  calculus  is  that  we  can  associate  to  each  theory  T  a 
proof- theoretic  structure  P[T )  consisting  of  all  proofs  that  use  the  sentences  of  T  as 
axioms.  The  structure  of  P{T)  will  typically  relate  such  proofs  in  some  algebraic 
manner.  For  example,  P[T)  may  be  a  multicategorrf .  However,  the  general  axioms  of 
a  proof  calculus  to  be  given  below  will  not  impose  any  particular  structure;  they  will 
postulate  that  P{T)  has  some  structure,  by  declaring  it  an  object  of  some  category  of 
structures. 

Definition  10  A  multicategory  consists  of  a  set  0  of  basic  objects  together  with  a 
category  C  whose  objects  are  finite  strings  T  =  A\, ...,  A„  of  elements  of  0,  and  such 
that,  denoting  by  T,  A  the  concatenation  of  two  strings  and  denoting  by  0  the  empty 
string,  the  morphisms  of  C  have  a  monoid  structure,  with  the  multiplication  of  two 
morphisms  a  :  Ti  — »  Ax  and  0  :  r2  — »  A2  denoted  a,0  and  being  of  the  form 
a,0  :  rx,r2  — *  Ai,A2.  In  addition,  the  multiplication  a,0  is  actually  a  functor 

eI  take  ample  liberties  with  this  notion,  due  to  Lambek  [44],  and  give  a  definition  that  is  not 
equivalent  to  Lambek’s  but  allows  viewing  multicategories  in  Lambek’s  sense  as  a  particular  case. 
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— ►  C,  i.e.,  it  preserves  identities  and  composition.  For  readers  familiar  with 
monoidal  categories,  we  can  rephrase  the  definition  by  saying  that  a  multicategory  is 
a  strict  monoidal  category  [47]  whose  monoid  of  objects  is  free.  The  general  notion  of 
homomorphism  between  multicategories  is  that  of  a  functor  that  preserves  the  monoid 
operation  _  on  the  noee,  i.e.,  a  strictly  monoidal  functor.  However,  I  shall  impose  the 
additional  restriction  that  the  functor  maps  basic  objects  to  basic  objects.  I  denote  by 
MultCat  the  category  with  objects  multicategories  and  with  morphisms  the  strictly 
monoidal  functors  that  satisfy  this  additional  restriction.  □ 

Example  11  (Natural  Deduction)  Given  a  theory  T  =  (E,  A)  in,  say,  first  order  logic, 
we  can  associate  to  it  a  multicategory  P(T)  with  sen(E)  as  its  set  of  basic  objects7  and 
with  morphisms8  a  :  Ax,...,  An  — ►  B\,...,Bm  consisting  of  sequences  a  = 
with  on  a  natural  deduction  proof  tree  of  B,  whose  leaves  only  involve  formulas  among 
those  in  A  and  in  A\ , ...,  An.  The  identity  id au...ju  i*  the  sequence  Ax, ...,  An  viewed  as 
a  sequence  of  proof  trees;  the  multiplication  a,0  is  the  concatenation  of  the  two  strings 
of  proof  trees.  Composition  of  a  :  Ax,...,  An  — *■  B\,...,Bm  with  0  :  B\,...,Bm  — ► 
Cx,...,  (7*  is  a  sequence  7  =  71,  ...,7*  with  7 <  the  proof  tree  obtained  from  the  tree  ft 
by  glueing  the  tree  a}  at  each  leaf  occurrence  of  □ 

We  would  like  to  view  P  as  a  functor  Th  — ►  Mult  Cat.  However,  some  caution 
is  required.  The  problem  is  that,  given  a  theory  morphism  H  :  (E ,T)  — ►  (E',r*)  a 
sentence  <p  €  T  is  mapped  by  £T  to  an  element  of  I’'*,  but  H(<p)  does  not  necessarily 
belong  to  T'.  In  seeking  a  natural  translation  of  proof  trees  to  define  a  morphism 
P[H)  :  P(E,T)  — ►  P(E',r)  in  MultCat.  we  run  into  a  problem  of  indeterminacy. 
This  problem  appears  when  we  try  to  map  the  proof  0  — ►  <p,  of  an  axiom  <p  6  I\ 
which  consists  of  the  one  node  tree  <p,  to  a  proof  0  — ►  H(<p),  of  H(<p),  since  when 
H(<p)  is  not  in  V  many  different  proofs  may  be  possible.  This  difficulty  has  an  easy 
solution  by  restricting  our  attention  to  the  subcategory  Thp  '-*■  £h  of  axiom-preserving 
theory  morphisms.  In  this  way,  we  get  a  functor  Thp  — ►  MultCat. 

We  can  forget  about  the  compositionality  of  proofs,  and  extract  from  P{T)  the  set 
of  all  proo&  of  theorems  of  T,  proofs(T)  =  {a  :  0  — ►  <p  in  P(X)  |  <p  €  sen(T)}.  We 
can  obtain  proofs(T)  as  the  set  Pr(P(T)),  where  Pr  is  a  functor  Pr  :  MultCat  — ►  Set 
sending  each  multicategory  (0,  C)  to  the  set  Pr(0,  C)  =  {a  :  0  — ►  Ain  C  \  A  €  0}. 
The  way  in  which  the  proof  calculus  and  the  entailment  system  are  linked  is  quite 
natural;  it  is  given  by  the  function  7 rj*  :  proofs(T)  — ►  sen(T )  mapping  each  proof 
a  :  0  — *  <p  to  its  corresponding  theorem  <p.  Therefore,  the  inverse  image  tt 
yields  the  set  of  all  proofs  of  if  <p  is  not  a  theorem  this  is  the  empty  set.  It  is  then 
easy  to  check  that  ?rr  is  a  natural  transformation  ir :  Pr  o  P  =>■  sen. 

Definition  12  A  proof  calculus  is  a  6-tuple  P  =  (Sign,  sen ,  h,  P,  Pr,  tt)  with: 

1.  (Sign,  sen,  H)  an  entailment  system; 

7  Actually,  we  want  the  basic  objects  to  be  formulas  rather  than  sentences;  however,  this  is  a  minor 
point,  since  we  may  assume  that  sen(E)  has  been  defined  as  consisting  of  formulas,  and  the  notion  of 
satisfaction  extends  easily  to  formulas. 

‘Notice  that  the  sequences  of  sentences  have  conjunctive  meaning  in  both  the  domain  and  the 
codomain  of  a  morphism  a. 
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2.  P  :  Thp  — ►  Struct  p  a  functor;  for  each  theory  T,  the  object  P{T)  S  Struct p  is 
called  its  proof-theoretic  structure ; 

3.  Pr  :  Struct p  — ►  Set  a  functor;  for  each  theory  T,  the  set  Pr(P(T))  is  called  its 
set  of  proofs.  We  shall  denote  by  proofs  the  composite  functor  Pro?  :  Thy  — ♦ 
Set : 

4.  ir :  proofs  =>  sen  a  natural  transformation,  such  that  for  each  theory  T  =  (£,  T), 
the  image  of  ttt  :  proofs(T)  — *•  sen(T)  is  the  set  I”.  The  map  Try  is  called  the 
projection  from  proofs  to  theorems  for  the  theory  T. 

A  proof  calculus  P  is  called  compact  iff  its  underlying  entailment  system,  denoted 
ent(P),  is  compact.  □ 

We  are  now  ready  to  axiomatize  the  notion  of  a  logical  system,  consisting  of  a  logic 
together  with  a  choice  of  a  proof  calculus  for  it. 

Definition  13  A  logical  system  is  an  8-tuple  S  =  {Sign,  sen.  Mod,  b,  [=,  P,  Pr,  ir)  such 
that: 

1.  {Sign,  sen.  Mod .  b.  |=)  is  a  logic,  and 

2.  {Sign,  sen,  b,  P,  Pr,  it)  is  a  proof  calculus. 

A  logical  system  is  called  complete,  compact,  liberal,  exact,  or  is  said  to  admit  initial 
models  iff  the  corresponding  properties  hold  for  its  underlying  logic.  Given  a  logical 
system  S,  its  underlying  logic  will  be  denoted  log{S)‘,  similarly,  its  underlying  proof 
calculus  will  be  denoted  pcalc{$).  □ 

3.1  Proof  Subcalculi 

It  is  quite  common  to  encounter  proof  systems  of  a  specialized  nature.  In  these  calculi, 
only  certain  signatures  are  admissible  as  syntax,  e.g.,  finite  signatures,  only  certain 
sentences  are  allowed  as  axioms,  and  only  certain  sentences  (possibly  different  from  the 
axioms)  are  allowed  as  conclusions.  The  obvious  reason  for  introducing  such  calculi 
is  that  proofs  are  simpler  under  the  given  restrictions.  This  may  serve  technical  or 
esthetical  and  expository  purposes  in  logic;  in  computer  science,  however,  the  choice 
between  an  efficient  and  an  inefficient  calculus  may  have  dramatic  practical  conse¬ 
quences.  For  logic  programming  languages,  such  calculi  do  (or  should)  coincide  with 
what  is  called  their  operational  semantics,  and  mark  the  difference  between  a  hopeless 
theorem  prover  and  a  very  efficient  programming  language.  Indeed,  one  of  the  main 
tasks  of  logic  programming  is  finding  efficient  proof  calculi  by  imposing  judicious  re¬ 
strictions  on  the  choice  of  axioms  and  conclusions.  For  example,  the  language  Prolog 
emerged  from  the  realization  that  resolution  could  be  made  much  more  efficient  when 
the  axioms  are  restricted  to  Horn  clauses,  and  equational  programming  languages  such 
as  OBJ  [19,29]  exploit  the  fact  that  term  rewriting,  which  is  complete  for  equations 
that  are  Church- Rosser  and  terminating,  is  enormously  more  efficient  than  unrestricted 
equational  deduction. 

Definition  14  A  proof  subcalculus  is  a  9-tuple  <2  =  {Sign,  sen,  b,  Sign0,  ax,  concl,  P , 
Pr,i r),  with: 
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1.  (Sign,  sen ,  h)  an  entailment  system. 

2.  Signo  a  subcategory  of  Sign  called  the  subcategory  of  admissible  signatures.  We 
denote  by  seno  the  restriction  of  the  functor  sen  to  the  subcategory  Sign 0. 

3.  ax  :  Signo  — ►  Set  a  subfunctor  of  the  functor  obtained  by  composing  seno  with 
the  powerset  functor,  i.e.,  there  is  a  natural  inclusion  ox(E)  C  P(sen(E))  for  each 
E  €  Signo.  Each  T  €  ox(E)  is  called  a  set  of  admissible  axioms  specified  by  Q. 
This  defines  a  subcategory  Th^  of  Thp  whose  objects  are  theories  T  =  (E,T) 
with  E  €  Signp  and  T  €  02(E),  and  whose  morphisms  are  axiom-preserving 
theory  morphisms  H  such  that  H  is  in  Signp. 

4.  eonel  :  Signp  — ►  Set  a  subfunctor  of  the  seno  functor.  The  sentences  <p  € 
eonel( E)  are  called  the  admissible  conclusions  specified  by  Q. 

5.  P  :  Thu  — ►  Struct q  a  functor;  for  each  T  €  Th^,  the  object  P(T)  6  Struct q 
is  called  its  proof-theoretic  structure. 

6.  Pr  :  Struct q  — ►  Set  a  functor;  for  each  T  €  ThM,  the  set  Pr(P(T))  is  called 
its  underlying  set  of  proofs  of  admissible  conclusions.  We  denote  by  proofs  the 
composite  functor  ProP. 

7.  7r  :  proofs  =>  seno  a  natural  transformation,  such  that  for  each  T  —  (E,F)  € 
Tkax  the  image  of  *t  :  proof s(T)  — *•  sen(T)  is  the  set  T*  n  concl  ( E).  The  map 
irT  is  called  the  projection  from  proofs  to  admissible  theorems  for  the  theory  T. 

Given  a  proof  subcalculus  Q,  ent(Q)  will  denote  its  underlying  entailment  system. 


Notice  that  when  no  restrictions  at  all  are  placed  on  signatures,  axioms  and  con¬ 
clusions,  i.e.,  when  Signp  =  Sign,  as(E)  =  -P(aen(E))  and  concl{L)  =  sen(E),  a  proof 
subcalculus  is  the  same  thing  as  a  proof  calculus. 

We  can,  finally,  axiomatize  the  notion  of  a  logical  subsystem,  consisting  of  a  logic 
together  with  a  choice  of  a  proof  subcalculus  for  it. 

Definition  15  A  logical  subsystem  is  an  11-tuple  S  —  (Sign,  sen.  Mod,  h,  (=,  Signp,  ax, 
concl,  P,  Pr,  ir)  such  that: 

1.  (Sign,  sen.  Mod,  H,  |=)  is  a  logic,  and 

2.  (Sign,  sen,  (-,  Signp,  ax,  concl,  P,  Pr,  ir)  is  a  proof  subcalculus. 

Given  a  logical  subsystem  S,  its  underlying  logic  will  be  denoted  log(S)\  similarly,  its 
underlying  proof  subcalculus  will  be  denoted  pscalc(S).  □ 

3.2  Effective  Proof  Subcalculi 

This  section  gives  additional  axioms  for  proof  calculi  that  are  effective  in  the  intuitive 
sense  of  being  mechanizable  by  an  (idealized)  computer.  The  axioms  are  not  as  expres¬ 
sive  or  as  general  as  possible9;  however,  they  will  suffice  for  many  purposes,  including 
our  axiomatization  of  logic  programming  in  Section  6. 

9  For  example,  they  only  consider  theories  with  a  finite  set  of  axioms,  and  proof-theoretic  structures 
are  involved  only  indirectly,  through  their  underlying  set  of  proofs. 
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The  challenge  in  a  topic  like  this  is  to  avoid  boring  and  annoying  the  reader  (and 
the  writer!)  with  horrible  encodings  of  everything  into  the  natural  numbers.  To  this 
purpose,  I  will  follow  the  axiomatic  approach  to  computability  outlined  by  Shoenfield 
in  [66].  The  basic  notions  axe  that  of  a  finite  object ,  a  space  of  finite  objects,  and 
an  algorithm.  In  Shoenfield’s  own  words,  a  finite  object  is  an  “object  which  can  be 
specified  by  a  finite  amount  of  information;”  computer  scientists  would  call  this  a  finite 
data  structure.  A  space  is  “an  infinite  class  X  of  finite  objects  such  that,  given  a  finite 
object  x,  we  can  decide  whether  or  not  x  belongs  to  X .”  Computer  scientists  would 
call  this  a  data  type.  Given  spaces  X  and  Y ,  a  recursive  function  f  :  X  — *•  Y  is  then  a 
(total!)  function  that  can  be  computed  by  an  algorithm ,  i.e.,  by  a  computer  program, 
when  we  disregard  space  and  time  limitations;  more  generally,  if  the  algorithm  may 
not  terminate  for  some  inputs,  we  call  the  corresponding  /  a  partial  recursive  function 
from  X  to  Y.  An  r.e.  subset  of  a  space  Y  is  a  subset  of  the  form  f[X)  for  some  partial 
recursive  function  /  :  X  — ►  Y.  Spaces  and  recursive  functions  form  a  category  Space , 
and  there  is  an  obvious  forgetful  functor  U  :  Space  — ►  Set  to  the  category  of  sets  and 
functions.  Notice  that  if  X  is  a  space,  then  the  set  Pfin(X)  of  finite  subsets  of  X  is  also 
a  space.  We  are  now  ready  to  axiomatize  effective  proof  calculi.  Since  proof  subcalculi 
generalize  proof  calculi,  only  effective  proof  subcalculi  are  defined.  The  reader  may 
keep  in  mind  the  case  of  first  order  deduction  for  theories  with  a  finite  signature  and 
a  finite  set  of  axioms  — possibly  with  additional  restrictions  on  the  axioms  and  on 
the  theorems  that  we  wish  to  prove —  as  a  standard  example.  Another  interesting 
example  is  the  effective  proof  subcalculus  of  equational  logic  provided  by  Church- 
Rosser  and  terminating  term  rewriting  systems,  in  which  the  admissible  signatures  are 
finite  signatures,  and  the  sets  of  admissible  axioms  are  finite  sets  of  Church-Rosser 
and  terminating  equations,  with  proofs  being  performed  by  term  rewriting.  Note  that 
the  Church-Rosser  property  is  a  property  of  an  entire  set  of  equations,  not  of  the 
individual  equations.  Many  other  examples  of  effective  proof  subcalculi  are  discussed 
in  Section  6. 

Definition  16  An  effective  proof  subcalculus  is  a  10-tuple  Q.  =  [Sign,  sen,  h,  Signp, 
seno,  ax,  concl,  P,  Pr,  jt),  such  that: 

1.  (Sign,  sen,  \~)  is  an  entailment  system. 

2.  Signp  is  a  subcategory  of  Sign  called  the  subcategory  of  admissible  signatures. 
We  denote  by  J  the  subcategory  inclusion  functor  Signp  c-+  Sign. 

3.  seno  :  Signp  — ►  Space  is  a  functor  such  that  ll  o  seno  =  sen  o  J .' 

4  .ax  :  Signp  — *  Space  is  a  subfunctor  of  the  functor  obtained  by  composing 
seno  :  Signp  — *  Space  with  the  functor  P/,n  :  Space  — ►  Space,  that  sends  each 
space  to  the  space  of  its  finite  subsets.  This  defines  a  subcategory  Thax  of  Thp 
whose  objects  are  theories  T  =  (E,T)  with  E  €  Signp  and  T  €  ax(E),  and  whose 
morphisms  are  axiom-preserving  theory  morphisms  H  such  that  H  is  in  Signp. 

5.  concl :  Signp  — *■  Space  is  a  subfunctor  of  the  functor  seno  :  Sign  — *  Space. 

6.  P  :  Thax  — ♦  Struct  q  is  a  functor. 

7.  Pr  :  Struct q  — ►  Space  is  a  functor.  We  denote  by  proofs  the  composite  functor 
ProP. 
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8.  k  :  proofs  =>  sene  is  a  natural  transformation. 

9.  Denoting  also  by  ax,  concl,  Pr ,  and  it  the  results  of  composing  with  U  each 
of  the  above  functors  and  the  natural  transformation  ir,  the  9-tuple  U(Q)  = 
[Sign,  sen,  h,  Sigrtp,  ax,  eonci,  P,  Pr,  t)  is  a  proof  subcalculus. 


Notice  that  if  Q,  is  an  effective  proof  subcalculus,  for  each  theory  T  €  Thaa  there 
is  an  associated  partial  recursive  function  search?  :  concl  (T)  x  IN  — ►  Pjin[proofs{T)) 
such  that  for  each  pair  (<p,  n),  consisting  of  an  admissible  conclusion  <p  and  a  natural 
number  n,  search?  [<p,  n)  is  undefined  if  has  cardinality  strictly  less  than  n,  and 

otherwise  yields  a  subset  of  n  elements  in  such  that,  when  tt?1^)  is  infinite, 

we  have  n?l[<p)  =  UneNat  scarch?(<p,  n).  An  algorithm  to  compute  search?  can  be 
obtained  as  follows:  since  all  spaces  are  isomorphic  [66],  there  is  a  listing  of  proof s(T), 
i.e.,  a  bijective  recursive  function  a  :  IN  — ►  proof s[fT).  We  define  search?[(p,n)  =  0; 
to  compute  search?[<p ,  n  + 1),  we  just  scan  through  the  listing  a  until  we  find  the  first 
n  +  1  proofs  ac(t'i), ...,  a(*w+i)  such  that  ir?(a(ij))  =  <p.  At  times,  it  is  possible  to  do 
better  than  this,  and  provide  an  algorithm  that  behaves  like  the  above  search?  when 
search?  is  defined,  but  such  that  for  some  of  the  inputs  (<p,n)  for  which  search?  is 
undefined,  it  yields  the  value  0,  understood  as  positive  failure  in  finite  time  to  find  a  set 
of  n  proofs.  We  shall  call  any  such  function  a  search  function  for  T  in  the  subcalculus 

fi. 

4  Relating  Logics 

In  this  section,  different  notions  of  map  are  introduced  and  motivated  with  examples  for 
the  different  logical  structures,  i.e.,  for  entailment  systems,  institutions,  logics,  proof 
(sub)calculi,  and  logical  (sub)systems.  Each  such  type  of  logical  structure  together 
with  its  corresponding  maps  determines  a  category,  and  those  categories  are  then 
related  by  forgetful  functors  and  by  adjoint  functors,  hi  all  cases  there  is  a  notion  of 
substructure,  such  as  a  subentailment  system,  a  sublogic,  etc.,  that  is  always  expressed 
as  a  map  satisfying  special  properties,  thus  giving  an  axiomatic  expression  to  the 
corresponding  intuitive  notion. 

4.1  Mapping  Entailment  Systems 

The  intuitive  idea  is  simple  enough:  we  want  to  map  the  syntax  and  the  sentences 
between  two  entailment  systems  in  a  way  that  is  consistent,  i.e.,  such  that  it  preserves 
ent  ailments. 

Example  17  Consider  the  relationship  between  the  entailment  systems  ent(Eqtl)  of 
equational  logic  and  ent(Fol)  of  first  order  logic  with  equality.  We  map  a  functional 
signature,  consisting  of  a  ranked  alphabet  F  of  function  symbols,  to  the  first  order 
signature  $(F)  =  (.F,  0)  with  same  function  symbols  and  with  no  predicate  symbols. 
Similarly,  we  can  define  a  map  a  sending  an  equation  t  =  t!  to  the  first  order  sentence 
Vxi...Vx„  t  =  t1  where  x1,...,xn  are  the  variables  occurring  in  either  t  or  f.  For  T  a  set 
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of  equations  and  a(r)  the  set  of  its  corresponding  first  order  sentences  we  of  course 
have 

r  \~p  t  =  t1  =>  «(r)  =  t1). 

Indeed,  we  actually  have  an  equivalence  rather  than  just  an  implication  between  the 
two  entailments,  so  that  this  particular  map  is  conservative.  □ 

Maps  of  entailment  systems  that  send  a  signature  to  another  signature  are  called 
plain.  However,  there  are  natural  examples  of  maps  that  are  not  plain. 

Example  18  Let  VFo/  denote  the  fragment  of  first  order  logic  without  equality  con¬ 
sisting  of  sentences  that  are  the  universal  quantification  of  a  quantifier  free  formula. 
Let  VFnFoli  be  the  fragment  of  two-sorted  first  order  logic  with  equality  having 
signatures  that  involve  only  function  symbols,  and  with  sentences  also  restricted  to 
universal  sentences.  We  shall  use  the  symbol  u  to  denote  one  of  the  sorts,  and  the 
symbol  bool  to  denote  the  other.  The  idea  of  viewing  every  predicate  as  a  characteristic 
function  yields  a  map  relating  the  entailment  systems  of  these  two  logics.  However,  an 
unsorted  signature  (F,  P)  should  not  be  mapped  to  another  signature,  but  to  a  theory 
$(F,P)  whose  signature  consists  only  of  the  following  function  symbols: 

1.  for  each  n-ary  function  symbol  /  €  F  there  is  a  function  symbol  /  :  un  — ♦  u; 

2.  for  each  n-ary  predicate  symbol  p  €  P  there  is  a  function  symbol  p°  :  u"  — ►  bool ; 

3.  there  are  constants  true,  false ,  a  unary  operation  not  and  binary  operations  and, 
or,  implies ,  all  with  sort  bool  for  their  arguments  and  their  result. 

The  axioms  of  the  theory  $(F,  P)  are  those  needed  to  force  the  interpretation  of  the 
sort  bool  to  be  a  two-element  boolean  algebra.  The  key  axioms  are  true  ^  false  and 
(V6  :  600/ )  (6  =  true )  V  (6  =  false).  In  addition,  we  must  give  equational  axioms  forcing 
not,  and,  or  and  implies  to  have  the  standard  meaning. 

We  can  now  define  a  translation  a  between  sentences  in  the  expected  way.  We 
translate  a  quantifier  free  formula  <p  into  a  term  <p°  of  sort  bool  as  follows.  An  atomic 
formula  p(ti,...,tn)  is  translated  into  the  term  p°(*i, ...,*»»)>  and  the  connectives  are 
translated  in  the  obvious  way,  e.g.,  <p  A  0  is  translated  into  and(<p°,ifr°),  etc.  We  then 
translate  a  sentence  Vzi...Vz„  <p  with  <p  quantifier  free  into  the  equational  sentence 
Vz1...Vzn  <p°  =  true.  As  in  the  previous  example  we  have 

r  <p  =>  a(r)  h*(s)  a(<p ) 

and,  indeed,  the  implication  is  also  in  this  case  am  equivalence.  The  map  from  signa¬ 
tures  to  theories  is  in  fact  a  functor10  $  :  Signypot  — ►  ThvpnFoi= 0,  and  the  translation 
a  is  a  natural  transformation  a  :  senvm  ==>  senvFnFoi=  0  □ 

Another  interesting  example  of  a  map  of  entailment  systems  that  is  not  plain  is 
furnished  by  the  translation  of  the  second  order  lambda  calculus  into  Martin-Lof  type 
theory  described  in  [52]. 

Notice  that  any  functor  $  :  Sign  — ►  Th'0  together  with  a  natural  transformation 
a  :  sen  =>■  sen'  o  $  can  easily  be  extended  to  a  functor  $  :  Thp  — *  Th'Q,  called 

10  Aa  before,  we  denote  by  Thp  the  sabcategory  of  Th  whose  theory  morphisms  map  axioms  to  axioms. 
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the  at- extension  to  theories  of  the  original  functor,  by  mapping  a  theory  T  =  (E,T) 
to  the  theory  4(T)  with  same  signature  as  that  of  4(E)  and  with  axioms  those  of 
4(E)  together  with  the  axioms  aj(r).  Notice  also  that  the  natural  transformation 
a; :  sen  =►  sen'  o  4  can  be  similarly  extended  to  a  natural  transformation  a  :  sen  => 
sen'  o  4  between  the  functors  sen  :  Thp  — ►  Set  and  sen'  o  4  :  Thp  — ►  Set:  we  just 
define  =  as.  Therefore,  we  may  as  well  view  4  as  a  functor  4  :  Thp  — ►  Th'Q 
mapping  theories  to  theories,  and  call  a  functor  4  :  Thp  — ►  Thp  ct-simplc  iff  it  is  in 
fact  the  o-extension  to  theories  of  a  functor  4  :  Sign  — *  Th^.  In  addition,  we  call 
4  at- plain  if  it  is  the  a^extension  to  theories  of  a  functor  4  that  factors  through  the 
functor  F  :  Sign'  — ►  Th$  sending  each  signature  E'  to  the  theory  (E',0),  i.e.,  if  it  is 
the  oe-extension  to  theories  of  a  functor  mapping  signatures  to  signatures. 

This  way  of  relating  entailment  systems  is  already  quite  general,  but  is  it  general 
enough?  The  answer  is  “no.”  There  are  natural  and  interesting  examples  of  maps 
between  entailment  systems  that  map  theories  in  a  more  subtle  way. 

Example  19  (Unfailing  Knuth-Bendix  Completion)  Consider  the  entailment  system 
of  equational  logic  ent(Eqtl).  By  using  an  unfailing  Knuth-Bendix  algorithm  [1],  we 
can  associate  to  an  equational  theory  T  =  (E,T)  a  (possibly  infinite)  Knuth-Bendix 
completion  KB{T)  =  (E,KB(r))  so  that  equational  deduction  in  T  can  be  treated  by 
term  rewriting  in  KB(T).  This  can  be  viewed  as  a  functor,  KB  :  Theqaa  — ►  Thsqtia 
which  is  not  a  simple  functor.  Since  T  and  KB(T)  are  isomorphic  theories  we  have 

T  1-y  t  =  i*  <=>  T  hnra(T)  t  =*  t! . 

After  defining  the  notion  of  a  mapping  of  entailment  systems  we  shall  be  able  to 
see  that  the  functor  KB,  together  with  the  identity  natural  transformation  from 
sen  to  itself,  give  us  an  entailment  system  map  ( KB,lttn )  :  ent(Eqtl)  — ►  cnt(Eqtl). 
□ 


An  even  simpler  example  is  provided  by  closure  under  entailment. 

Example  20  (Entailment  Closure)  Given  an  entailment  system  £  —  (Sign,  sen,  h), 
the  functor  (_)*  :  Thp  — ►  Thp  mapping  a  theory  T  =  (E,T)  to  the  theory  T*  =  (E,  T*), 
together  with  the  identity  natural  transformation  l^n  from  sen  to  itself,  will  give  us 
an  entailment  system  map  ((_)*,  1^)  :  £  — ►  £ .  □ 

Notice  that  the  functors  KB  and  (.)*  map  theories  having  the  same  signature  to 
theories  having  the  same  signature;  actually,  in  these  two  examples  signatures  are  left 
unchanged.  Let  us  denote  by  (E',  T')  the  image  obtained  by  applying  to  a  theory  (E,  T) 
a  functor  4  :  Thp  — ►  Th'0  that  maps  theories  with  the  same  signature  to  theories 
with  the  same  signature.  In  particular,  we  denote  by  (E',  0*)  the  theory  4(E,  0).  If  the 
functor  4  is  a-simple,  we  have  the  following  property: 

V  =  0'UaE(r). 

This  is  not  satisfied  by  KB  and  (.)*;  however,  they  satisfy  the  weaker  condition 

(ry  =  (0'uaE(r))\ 

These  functors,  although  more  general,  are  “sensible”  in  the  following  sense. 
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Definition  21  Given  entailment  systems  £  =  {Sign,  sen,  b)  and  £'  =  {Sign1,  sen', 
a  functor  $  :  Thp  — ►  Th p  and  a  natural  transformation  a  :  sen  =^>  sen’  o  $,  we 
call  $  a-sensible  iff  the  following  conditions  are  satisfied: 

1.  There  is  a  functor  :  Sign  — ►  Sign'  such  that  sign'  o$  =  fo  3ign. 

2.  (T')*  =  (0'  U  aE(r))*. 

□ 


a-sensible  functors  have  the  nice  property  that  their  natural  transformation  a  only 
depends  on  the  signatures,  not  on  the  theories.  This  is  a  consequence  of  the  following 
lemma. 

Lemma  22  Given  entailment  systems  £  =  {Sign,  sen,  b)  and  £'  =  {Sign1,  sen',  b'), 
and  a  functor  $  :  Thp  — ►  Th'0  satisfying  condition  (1)  in  Definition  21,  then  any 
natural  transformation  a  :  sen  ==>  sen '  o  $  can  be  obtained  by  horizontal  composition 
with  the  functor  sign  :  Thp  — ►  Sign  from  a  natural  transformation  a°  :  sen  => 
sen'  o 

Proof:  Again,  we  use  the  notation  $(E,T)  =  (S',T').  What  we  have  to  show  is 
that  for  any  T\  =  (E,  T),  Tt  =  (E,  A)  in  Thp,  we  have  ar,  =  ar,-  Then,  we  can 
define  a*.  =  ay,.  Notice  that,  for  any  T*  =  (E,A),  we  have  a  theory  morphism 
Is  :  (E,0)  — ►  (E,  A)  in  Thp.  Therefore,  it  is  enough  to  establish  this  property  when 
Ti  =  (E,0).  Since  $  satisfies  condition  (l),  we  must  have: 

$(1E  :  (E,0)  —  (E,  A))  =  {lz.  :  (E',01)  —  (E',  A')). 

Therefore,  we  have  sen(ls)  =  l«n(s),  and  sen'($(ls))  =  sen'{  1E.)  =  l„n'(s')-  The 
naturality  of  a  then  forces  ajt  =  cut,,  as  desired.  □ 

By  abuse  of  language,  in  the  following  we  shall  drop  the  “diamond”  and  write  aj 
instead  of  a|. 

Definition  23  Given  entailment  systems  £  =  (St£n,sen,b)  and  £'  =  {Sign1,  sen', 
b'),  a  map  of  entailment  systems  (4>,  a)  :  £  — ►  £'  consists  of  a  a  natural  transforma¬ 
tion  a  :  sen  =>  sen'  o  $  and  an  a-sensible  functor  $  :  Thp  — ►  Th'0  satisfying  the 
following  property: 


r  bE  ip  =>  aE(r)  b'#(E0)  ut{<p). 

We  call  ($,a)  conservative  if  in  addition  we  have, 

r  bE  V?  aE(r)  az{<p). 

We  call  ($,a)  plain  if  $  is  a-plain,  and,  similarly,  we  call  ($,a)  simple  if  $  is 
a-simple. 

A  subentailment  system  is  a  map  ($,a)  :  £  — ►  £'  of  entailment  systems  that  is 
plain,  conservative,  with  $  faithful  and  injective  on  objects,  and  with  a  injective.  We 
write  ($,a)  :  £  £'  to  denote  a  subentailment  system.  □ 
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Example  17  shows  that  the  entailment  system  of  equational  logic  is  a  subentailment 
system  of  that  of  first  order  logic.  Note  that  in  that  example  neither  $  nor  a  are  actual 
inclusions;  they  are  only  injections. 

The  following  lemma  follows  easily  from  the  basic  properties  of  sensible  functors 
ana  is  left  as  an  exercise. 

Lemma  24  For  ($,  a)  :  £  — ►  £'  a  map  of  entailment  systems,  the  following  property 
is  satisfied: 

r  t-(EtA)  <p  =>  aE(r)  h;(2  a)  axfo?). 

Furthermore,  if  ($,  a)  is  conservative,  the  above  implication  is  actually  an  equivalence. 

□ 

Given  maps  of  entailment  systems  ($i,ati)  :  £  — ►  £'  and  ($2,a3)  :  £'  — ►  £"  we 
can  define  their  composition  ($j,  a2)  o  ($l5  ai)  :  £  — ►  £"  as  the  pair  ($3,  as)  with 
$3  =  $2o$i  and  with  as  the  natural  transformation  obtained  by  pasting  together  the 
two  cells 


Using  Lemma  24,  it  is  easy  to  check  that  this  composition  is  itself  also  a  map  of 
entailment  systems  and  that,  therefore,  we  have  a  category  Ent  whose  objects  are 
entailment  systems  and  whose  morphisms  are  maps  between  them. 

The  following  lemma  is  left  as  an  exercise. 

Lemma  25  The  composition  of  conservative  maps  of  entailment  systems  is  conser¬ 
vative.  Furthermore,  if  for  ($i,ai)  :  £  — ►  £'  and  ($j,  a2)  :  £'  — ►  £"  maps  of 
entailment  systems  the  composition  ($2,a3)  o  (#lf  ai)  :  £  — ►  £"  is  conservative,  then 
($i,aj)  :  £  — *  £’  is  conservative.  □ 

4.2  Mapping  Institutions 

The  idea  of  a  map  of  institutions  is  somewhat  counterintuitive.  Although  the  syntax 
part  is  mapped  just  as  for  entailment  systems,  the  models  are  mapped  in  the  opposite 
direction.  This  can  be  best  illustrated  by  an  example. 

Example  26  Let  Eqtl  be  equational  logic,  and  let  MSEqtl  be  many-sorted  equational 
logic.  The  process  of  “omitting  sorts”  should  map  the  underlying  institution  of  MSEqtl 
to  the  underlying  institution  of  Eqtl.  A  many-sorted  theory  is  a  pair  (5,  F)  with  S  a 
set  of  sorts,  and  F  an  alphabet  of  function  symbols,  each  with  a  rank  consisting  of  a 
pair  (u;,s)  with  11/6S’  giving  the  sorts  of  the  arguments  and  s  the  sort  of  the  result. 
By  omitting  sorts,  we  map  such  a  signature  to  the  signature  F  such  that  /  has  rank  n 
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iff,  in  its  original  rank  (to,  s),  to  had  length  n.  This  defines  a  functor  $  :  Sign^^  — ► 
SignFijtt  from  many-sorted  to  unsorted  signatures.  In  many-sorted  equational  logic, 
the  number  and  sort  of  the  variables  being  quantified  in  an  equation  must  be  made 
explicit  [31],  but  this  is  not  required  for  unsorti  1  equational  logic.  Omitting  sorts  and 
explicit  quantification  transforms  a  many-sorted  equation  into  an  unsorted  equation, 
and  we  can  view  this  as  a  natural  transformation  a  :  sen^sE#  =>  sengqt jo$.  However, 
for  (S,  F)  a  many-sorted  signature,  there  is  no  natural  way  of  associating  an  unsorted 
F-algebra  to  a  many-sorted  (S,  Fj-algebra.  What  is  natural  is  to  associate  to  the 
F-algebra  A  the  (S,  ^-algebra  0(A)  with  carrier  such  that  A,  =  A  for  all 

s  €  S,  and  such  that  each  function  symbol  /  with  rank  (w,  s)  such  that  w  has  length  n 
is  interpreted  by  the  same  function  Af  :  An  — ►  A  that  interpreted  /  in  the  F -algebra 
A.  This  gives  us  a  functor  0(f,s)  '■  ModEqtt(^(F,  S))  — ►  ModMSEqa(F,  S ),  and  globally 
defines  a  natural  transformation  0  :  Mod gyi  o  $  =>  ModMSEqti.  By  definition  of  0(A) , 
for  any  many-sorted  signature  (5,  F)  and  any  (S,  F)-equation  <p  we  have, 

A  (=i?  a(<p)  iff  0(A)  t=(s^)  ¥>■ 

Notice,  finally,  that  instead  of  restricting  ourselves  to  equational  logic  we  could  just 
as  well  have  considered  unsorted  first  order  logic  and  many-sorted  first  order  logic.  □ 

Definition  27  Given  institutions  I  =  (Sign,  sen.  Mod .  [=)  and  I'  =  (Sign1,  sen', 
Mod!.  [='),  a  map  of  institutions  ($,  a,0)  :  I  — ►  I'  consists  of  a  natural  transforma¬ 
tions  a  :  sen  =>  sen'  o  $,  an  a-sensible  functor11  $  :  Thp  — ►  Th'0,  and  a  natural 
transformation  0  :  Mod !  o  $op  ==>  Mod  such  that  for  each  E  6  Sign,  <p  €  sen(E),  and 
M'  6  Mod'($( E.0))  the  following  property  is  satisfied: 

M'  [=«jn'(*(E,0))  «e(^)  iff  /?(2 ,t)(M')  (=e  <p. 

We  call  ($,a,/3)  plain  iff  $  is  a-plain,  and  similarly  call  ($,a,/3)  simple  iff  $  is 
a-simple. 

A  subinstitution  is  a  map  ($,  a,0)  :  I  — ►  I'  of  institutions  that  is  plain,  with  $ 
faithful  and  injective  on  objects,  with  a  injective,  and  with  0  a  natural  isomorphism. 
We  write  ($,  a,  0)  :  I  <-*  I’  to  denote  a  subinstitution.  For  example,  equational  logic 
is  a  subinstitution  of  first  order  logic.  □ 

Given  maps  of  institutions  ($i,ai,/3i)  :  I  — ►  I'  and  ($2><*2,/?2)  :  I'  — ♦  I"  we 
can  define  their  composition  ($2,ot2,02)  o  ($i,ai,/?i)  :  I  — r*  J"  as  the  pair  ($3,a3,/?3) 
with  $3  =  <&2°$i  and  with  a3  the  natural  transformation  obtained  by  pasting  together 
the  two  cells 


11  The  functor  $  is  at-sensible  for  the  entailment  systems  (Sign,  sen,  f=)  and  (Sign1,  sen',  p')  associ¬ 
ated  to  I  and  1'.  Since  the  categories  Th^_0  and  T7i{_0  do  not  depend  at  all  on  the  entailment  relations 

f=  and  1=',  we  write  Thp  and  Th'0  instead. 
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and  0s  the  natural  transformation  obtained  by  pasting  together  the  two  cells 


It  is  easy  to  check  that  this  composition  is  itself  a  map  of  institutions.  Therefore, 
we  have  a  category  Inst  whose  objects  are  institutions,  and  whose  morph  isms  are  maps 
between  them.  Recall  now  that,  by  Proposition  4,  every  institution  I  has  an  associated 
entailment  system  I+.  This  is  just  the  object  part  of  a  functor  (_)+  :  Inst  — ►  Ent, 
as  shown  by  the  following  lemma. 

Lemma  28  If  ($,a,0)  :  1  — ►  I'  is  a  map  of  institutions,  then  ($,a)  :  I+  — ►  I'+ 
is  a  map  of  entailment  systems. 

Proof:  We  have  to  show 


r  f=s  <p  =>  as{T)  K*(n.S)  aE(^) 

which,  rephrased  in  terms  of  the  closures  and  adopting  the  notation  $(£,  T)  =  (E\  T'), 
reads 


per*  =►  arb)  e  (O'uaECr))’, 

but  <p  €  T*  iff  for  each  M  €  ModiE.T)  we  have  M  |=s  <p.  Notice  that,  since  $  is 
a-sensible,  we  have  $(1E  :  (£,0)  — ►  (E,r))  =  1e»  :  (£',0')  — ►  (E',r)  and  therefore 
0*  C  T'  so  that  ModCL'.T')  C  ModCL'.  01).  Now  consider  the  functor 

fan *  MosLV,  n  —  M2d(S,r); 

by  naturality  of  /?  this  is  just  the  restriction  of  the  functor 

0(2,0)  :  ModUY!.  0M  — ♦  M2d(S,0). 

This  implies  that  for  each  M1  €  MotfffrfS.n).  0(e,0)(-M7)  ^=n  which  is  equivalent 
to  M'  [=2*  This  shows  02!^)  €  T'*;  and  since  $  is  a-sensible  this  is  equivalent 

to  a2(<p)  €  (0*  U  a2(r))*,  as  desired.  □ 

Remark.  Although  closely  related  and  inspired  by  them,  the  maps  of  institutions 
defined  above  are  different  from  what  Goguen  and  Burstail  call  institution  morphisms 
[27,28].  These  are  of  the  form  ($,  a,0)  :  I  — ►  I',  with  $  :  Sign  — ►  Sign1  a  functor, 
and  a  :  sen'  0  $  =►  sen,  0  :  Mod  =>  Mod  o  natural  transformations  satisfying 
a  condition  entirely  similar  to  the  one  above.  Besides  their  restriction  to  a  “plain” 
$,  the  main  difference  is  that  their  a  and  0  go  in  exactly  the  opposite  direction  than 
mine.  Since  the  $  still  goes  in  the  same  direction,  the  concepts  are  not  dual  and 
their  relationship  is  not  entirely  clear  unless  special  properties,  such  as  adjointness, 
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are  assumed  for  $.  Both  notions  of  mapping  will  probably  be  needed  to  account 
for  all  relevant  examples.  In  this  presentation,  I  have  favored  the  notion  of  a  map 
of  institutions  because  it  fits  well  many  natural  examples  and  permits  flexible  ways 
of  mapping  theories.  Also,  one  of  the  motivations  for  defining  maps  of  institutions 
is  to  introduce  the  concept  of  a  sub  institution  as  a  map  of  institutions  that  satisfies 
additional  properties;  this  does  not  seem  possible  using  institution  morphisms. 

4.3  Mapping  Logics 

Mapping  logics  is  now  easy.  We  just  map  their  underlying  entailment  and  institution 
components. 

Definition  29  Given  logics  £  =  (Sign,  sen,  Mod.  K  f=)  and  £'  =  (Sign1,  sen',  Mod!. 

|='),  a  map  of  logics  ($,  a,0)  :  £  — *  £'  consists  of  a  functor12  $  :  Thp  — *  i  h'0, 
and  natural  transformations  a  :  sen  ==>  sen1  o  $  and  j3  :  Mod '  o  $  =>  Mod  such  that: 

1.  ($,a,fl)  :  inst(£)  — ►  inst(£')  is  a  map  of  institutions,  and 

2.  ($,a)  :  ent(£)  — ►  ent(£')  is  a  map  of  entailment  systems. 

Therefore,  we  have  a  category  Log  whose  objects  are  logics  and  whose  morphisms  are 
maps  of  logics,  and  there  are  forgetful  functors  inst :  Log  — »  Inst  and  ent  :  Log  — * 
Ent  yielding  the  underlying  institution  and  the  underlying  entailment  system  of  a  logic 
respectively. 

We  call  ($,  a,0)  plain  iff  #  is  a-plain,  and,  similarly,  call  ($,a,/?)  simple  iff  $ 
is  a-simple.  We  call  ($,  a,0)  conservative  iff  ($,  a)  is  so  as  a  map  of  entailment 
systems13. 

A  sublogic  is  a  map  ($,«,/?)  :  £  — *  £’  of  logics  that  is  both  a  subinstitution  for 
the  underlying  institutions  and  a  subentailment  system  for  the  underlying  entailment 
systems.  We  write  ($,«,/?)  :  £  £'  to  denote  a  sublogic.  □ 

Example  30  All  the  previous  examples  of  maps  of  entaulment  systems  and  maps  of 
institutions  were  fragmentary  descriptions  of  maps  of  logics: 

1.  Example  17  comes  from  a  sublogic  Eqtl  '— *■  Fol~  that  views  equational  logic  as 
a  fragment  of  first  order  logic  with  equality. 

2.  Omitting  sorts  is  a  (plain)  map  of  logics  MSEqtl  — ►  Eqtl  (or  more  generally, 
MSFoF  — >FoF).  The  paper  [31}  shows  that  this  map  is  not  conservative,  and 
characterizes  the  largest  subcategory  of  SignMeF  for  which  omitting  sorts  is 
conservative. 

3.  Notice  that,  viewing  each  unsorted  equational  signature  as  a  many-sorted  sig¬ 
nature  with  one  sort  gives  us  a  sub  logic  Eqtl  t—*  MSEqtl,  that  when  composed 
with  the  above  “omitting  sorts”  map  MSEqtl  — ►  Eqtl  yields  the  identity  map 
on  Eqtl;  therefore,  omitting  sorts  is  a  retract  map. 

12Notice  that,  since  Thp  does  not  depend  on  the  entailment  relation,  we  have  Thp  =  Thj_0,  so  that, 

indeed,  the  domain  and  codomain  of  $  are  not  altered  by  viewing  $  as  a  map  of  entailment  systems 
or,  alternatively,  as  a  map  of  institutions. 

13Strictly  speaking,  for  a  noncomplete  logic  we  could  distinguish  between  (--conservative  and  (=- 
conservative  maps.  However,  conservativeness  of  the  underlying  map  of  entailment  systems  seems  the 
most  important  notion. 


27 


4.  The  encoding  of  predicates  as  characteristic  functions  is  a  conservative  map  of 
logics  VfW  — ►  VFnFol*  which  is  not  plain. 

□ 


The  construction  given  in  Section  2.5  of  the  logic  I+  associated  to  an  institution  I 
is  actually  a  functor  (_)+  :  Inst  — ►  Log.  This  follows  easily  from  Lemma  28.  Actually, 
we  have  two  natural  ways  of  associating  a  logic  to  an  institution,  one  associates  the 
most  complete  logic,  and  the  other  associates  the  least  complete  one. 

Proposition  31  The  forgetful  functor  *  nst :  Log  — ►  Inst  has  both  a  left  and  a  right 
adjoint. 

Proof:  The  right  adjoint  is  precisely  (_)+,  and  the  unit  map  of  the  adjunction  is  the 
sublogic  C  *-+  (inst(£))+.  The  left  adjoint  is  the  functor  (_)“  :  Inst  — ►  Log  sending 
an  institution  I  to  the  logic  having  I  as  its  underlying  institution  and  such  that  T  h  <p 
iff  <p  €  T.  The  counit  of  this  adjunction  is  the  sublogic  (inst(L))~  *-»£.□ 

The  construction  given  in  Section  2.5  of  the  logic  £*  associated  to  the  entailment 
system  t  does  not  seem  to  correspond  to  a  functor  (_)t  :  Ent  — ►  Log.  Given  a  map 
($,a)  :  £  — ►  £',  the  natural  choice  for  a  transformation  0  for  the  models  seems 
to  go  in  the  wrong  direction,  since  it  sends  a  model  H  :  T  — ►  V  in  £t  to  a  model 
9(H)  :  9(T)  — ►  9(V)  in  £**.  There  are  several  possible  choices  for  the  directions 
of  the  natural  transformations  a  and  0  corresponding  to  different  notions  of  ‘‘map” 
betwen  entailment  systems  and  between  institutions,  and  this  construction  happens 
to  be  functorial  for  a  different  choice  of  directions. 

4.4  Mapping  Proof  Calculi 

The  intuitive  idea  is  very  simple.  Given  two  proof  calculi,  a  mapping  between  them 
must  first  of  all  involve  a  map  ($,  a)  of  their  underlying  entailment  systems.  In 
addition,  we  want  to  map  a  proof  p  of  a  theorem  <p  of  a  theory  T  €  Thp  to  a  proof 
'y(p)  of  the  theorem  a(<p)  of  the  theory  $(!T). 

Example  32  Consider  the  subentailment  map  ($,a)  :  ent(Eqtl)  «-+■  ent(Fol=)  from 
the  entailment  system  of  equational  logic  to  the  entailment  system  of  first  order  logic 
with  equality  described  in  Example  17.  We  can  associate  to  the  equational  entailment 
system  the  proof  calculus  Psqti  with  P  :  Thp  — ►  Cat  the  functor  sending  each  equa¬ 
tional  theory  T  =  (E,T)  to  the  category  P(T)  whose  objects  are  E- terms,  and  whose 
morphisms  are  chains  of  elementary  steps  of  equational  deduction  using  equations  in 
T.  Morphism  composition  is  chain  concatenation.  The  functor  Pr  :  Cat  — ►  Set  sends 
each  small  category  C  to  the  set  of  triples  Pr(C )  =  {(A,f,B)  \  f  :  A  — ►  B  in  C}. 
Therefore,  the  functor  proofs  sends  T  to  the  set  of  triples  of  the  form  (t,  p,  0)  with  t,  t? 
E-terms  and  p  a  chain  of  elementary  steps  of  equational  deduction  using  the  equations 
in  T.  The  projection  function  n  forgets  about  the  proof  p  and  yields  the  equation  t  =  t1. 
Consider  now  a  proof  calculus  associated  to  ent(Fol~).  We  can  for  example  have  a 
natural  deduction  proof  calculus  extended  with  rules  for  equality,  and  give  to  its  proofs 
the  structure  of  a  multicategory.  A  map  of  proof  calculi  Psqti  — ►  Pfu*  compatible  with 
the  subentailment  ($,  a)  :  ent(Eqtl)  t— *  ent(Fol~)  now  consists  of  a  systematic  way  of 
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translating  a  chain  of  elementary  equational  deductions  pit —*  tf  using  axioms  in  T 
into  a  natural  deduction  proof  'i{t,p,tl)  of  the  theorem  a(t  =  t?)  =  (Vxx...Vxn  t  =  t1) 
for  the  theory  $(T),  i.e.,  in  a  family  of  functions, 

It  ■  proof sEqu(T)  — *  proofsFol=($(T)). 

Of  course,  “systematic”  means  that  the  maps  7r  should  be  independent  of  changes  in 
syntax,  i.e.,  that  they  should  form  a  natural  transformation  7  :  proof SEqti  ==>  proof sFol=. 
The  fact  that  proofs  of  one  theorem  go  to  proofs  of  its  translation  by  a  has  a  concise 
expression  in  terms  of  natural  transformations,  as  explained  below.  □ 

Definition  33  Given  proof  calculi  P  =  [Sign,  sen,  h,P,  Pr,7r)  and  P'  =  [Sign1,  sen1, 
H,P',Pr',  7T1),  a  map  of  proof  calculi  ($,0,7)  :  P  — ►  P'  consists  of  a  map  ($,a)  : 
ent(P)  — *  ent(P')  of  the  underlying  entailment  systems  together  with  a  natural 
transformation  7  :  proofs  =>  proofs  0  $  such  that  the  following  cells  are  identical: 


An  embedding 14  of  proof  calculi  is  a  map  of  proof  calculi  ($,0,7)  :  P  — ►  P'  such 
that  ($,  a)  :  ent(P)  — ►  ent(P')  is  a  subentailment  system  and  7  is  injective.  We  write 
($,0,7)  :  P  P'  to  denote  an  embedding.  □ 

Another  good  example  of  a  map  of  proof  calculi  is  the  translation  of  proofs  in 
the  sequent  calculus  into  natural  deduction  proofs.  Basically,  a  proof  in  the  sequent 
calculus  can  be  viewed  as  a  set  of  directions  for  constructing  a  natural  deduction  proof 
in  normal  form  (see  [59],  Appendix  A). 

Given  maps  of  proof  calculi  ($i,ai,7i)  :  P  — ►  P'  and  ($2,0:2,72)  :  P'  — *  P"  we 
can  define  their  composition  ($2,  02,72)0($i>  <*1,71)  :  P  — ►  P"  as  the  triple  ($3,0:3,73) 
with  ($3,03)  the  composition  of  the  underlying  maps  of  entailment  systems,  and  73 
the  natural  transformation  obtained  by  pasting  together  the  two  ceils, 


14Since  the  term  “proof  subcalculus”  has  already  been  used  with  a  specific  technical  meaning,  I  use 
the  term  “embedding”  instead. 
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Checking  that  ($3,03,73)  is  itself  a  map  of  proof  calculi,  and  that  composition  is 
associative,  is  an  easy  cell  pasting  exercise.  Therefore,  we  have  a  category  PCale 
having  proof  calculi  as  its  objects  and  having  maps  of  proof  calculi  as  its  morphisms. 
There  is  of  course  a  forgetful  functor  ent :  PCale  — ►  Ent  sending  each  proof  calculus 
to  its  underlying  entailment  system. 

Proposition  34  The  functor  ent :  PCale  — ►  Ent  has  a  right  adjoint  (.)*  :  Ent  — ► 
PCale . 

Proof:  Given  an  entailment  system  £  =  (Sign,  sen,  h),  the  proof  calculus  <?'  is  the 
6-tuple  £*  =  (Sign,  sen,  h,  thm,  lstt,j)  with  thm  the  functor  sending  each  theory  to 
the  set  of  its  theorems  described  in  Section  2.2,  and  j  the  natural  subfunctor  inclusion 
thm  C  sen.  The  unit  of  the  adjunction  for  a  proof  calculus  P  =  (Sign,  sen,  h,  P,  Pr,  ir) 
is  the  map  l-m*)  *•  P  — ►  (ent(^))*,  where  ir  is  now  viewed  as  a  natural 

transformation  ir :  proofs  ==>  thm.  □ 

Since  proof  subcalculi  generalize  proof  calculi,  the  above  definition  of  a  map  of 
proof  calculi  generalizes  to  the  following  definition. 

Definition  35  Given  proof  subcalculi  P  —  (Sign,  sen,  h,Signp,  ax,  concl,  P,  Pr,  7r) 
and  P'  —  (Sign1, 3en',  H,  Sign q,  ax',  concl',  P1,  Pr1,  ir1),  a  map  of  proof  subealeuli  ($,  a,  7) 
:  P  — ►  P'  consists  of  a  map  ($,o)  :  ent(P)  — ►  cnt(P')  of  the  underlying  en¬ 
tailment  systems  such  that  the  functor  $  :  Thp  — ►  Th^  restricts  to  a  functor15 
$  :  Thaa  — ►  Th'asi  together  with  a  natural  transformation  7  :  proofs  ==>  proof  J  o  $ 
such  that  the  following  cells  are  identical: 


An  embedding  of  proof  subcalculi  is  a  map  of  proof  subcalculi  ($,0,7)  :  P  — ►  P' 
such  that  ($,at)  :  ent(P)  — ►  ent(P')  is  a  subentailment  system  and  7  is  injective.  We 
write  ($,a,  7)  :  P  <-*  P'  to  denote  an  embedding.  □ 

Composition  of  maps  of  proof  subcalculi  is  defined  as  for  proof  calculi,  and  we 
get  a  category  PS  Cole  with  objects  proof  subcalculi  and  morphisms  maps  of  proof 
subcalculi.  We  then  have  a  forgetful  functor  ent :  PS  Cole  — ►  Ent  sending  each  proof 
subcalculus  to  its  underlying  entailment  system. 

By  replacing  everywhere  the  category  Set  by  the  category  Space  and  considering 
effective  proof  calculi  as  objects  we  can  define  in  an  entirely  similar  way  the  notion  of 
an  effective  map  of  proof  calculi.  This  gives  us  a  category  EffPSCale  and  a  forgetful 
functor  U  :  EffPSCale  — ♦  PSCale. 

15  Intuitively,  this  says  that  9  transforms  theories  using  only  Sign p  signatures  and  ax  axioms  into 
theories  that  only  use  Sign^  signatures  and  ax'  axioms. 
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4.5  Mapping  Logical  Systems 

We  can  now  gather  everything  together  and  obtain  notions  of  a  map  of  logical  systems 
and  a  map  of  logical  subsystems. 

Definition  36  Given  logical  sytems  S  =  (Sign,  sen. Mod.  b,  |=,  P,  Pr,  n)  and  S'  = 
( Sign ',  sen', Mod \  H.  (='.  P‘ .  Pr1.  ir1)  a  map  of  logical  systems  ($,<*,/?, 7)  :  $  — ►  S' 
consists  of  a  functor  $  and  natural  transformations  a,/?, 7  such  that: 

1.  ($,a,/3)  :  log(S)  — ►  log(S' )  is  a  map  of  the  underlying  logics,  and 

2.  ($,0,7)  :  pcalc(S)  — ►  pcalc(S')  is  a  map  of  the  underlying  proof  calculi. 

This  defines  a  category  LogSys  whose  objects  are  logical  systems  and  whose  morphisms 
are  maps  of  logical  systems,  and  we  have  forgetful  functors  log  :  LogSys  — ►  Log  and 
pealc  :  LogSys  — ►  PC  ale. 

An  embedding  of  logical  systems  is  a  map  ($,  a,/?, 7)  :  S  — *  S'  of  logical  systems 
such  that  ($,  a.,0)  is  a  sublogic  and  ($,0,7)  is  an  embedding  of  proof  calculi.  We 
write  ($, a,/?, 7)  :  S  *-+  S'  to  denote  an  embedding  of  logical  systems.  □ 

The  following  result  is  entirely  analogous  to  Proposition  34  and  is  left  as  an  exercise. 

Proposition  37  The  functor  ent  :  LogSys  — ►  Log  has  a  right  adjoint  (_)*  :  Log  — ► 
LogSys.  □ 

Maps  of  logical  subsystems  axe  defined  in  an  entirely  analogous  manner. 

Definition  38  Given  logical  subsystems  $  =  (Sign,  sen.  Mod,  b,  |=,  Sign0 ,  ax,  concl, 
P,Pr,n)  and  S'  =  (Sign1 ,  sen' ,  Mod! ,bf ,  \='  ,Sign'Q,ax! , concl'  ,P'  ,-k')  a  map  of  log¬ 
ical  subsystems  7)  :  S  — *■  S'  consists  of  a  functor  $  and  natural  transforma¬ 

tions  a,  (3, 7  such  that: 

1.  ($,a,/?)  :  log(S)  — ♦  log(S')  is  a  map  of  the  underlying  logics,  and 

2.  ($,<*,7)  :  pscalc(S)  — ►  pscalc(S')  is  a  map  of  the  underlying  proof  subcalculi. 

This  defines  a  category  LogSSys  whose  objects  are  logical  subsystems  and  whose  mor¬ 
phisms  axe  maps  of  logical  subsystems,  and  we  have  forgetful  functors  log  :  LogSSys  — ♦ 
Log  and  pscalc  :  LogSSys  — ►  PS  Calc. 

An  embedding  of  logical  subsystems  is  a  map  ($,  a,  /?,  7)  :  S  — ►  S'  of  logical  subsys¬ 
tems  such  that  ($,  a,0)  is  a  sublogic  and  ($,  0,7)  is  an  embedding  of  proof  subcalculi. 
We  write  ($,a,/?,7)  :  S  *-*■  S'  to  denote  an  embedding  of  logical  subsystems.  □ 

We  can  summarize  the  relationships  between  the  different  categories  of  entailment 
systems,  institutions,  logics,  proof  (sub)calculi  and  logical  (sub)systems  by  the  follow¬ 
ing  commutative  diagram,  where  for  simplicity  only  the  forgetful  functors  are  included. 
The  several  adjoints  already  described  as  well  as  the  category  of  effective  proof  subcal¬ 
culi  axe  omitted,  but  they  should  be  kept  in  mind  to  obtain  a  more  complete  summary. 
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LogSya  r  LogSSya 


5  Categorical  Logics 

Categorical  logics  give  us  great  model-theoretic  flexibility,  since  their  models  are  not 
restricted  to  the  traditional  set-theoretic  structures  with  functions,  predicates,  etc., 
that  are  assumed  as  basic  even  in  axiomatic  approaches  such  as  abstract  model  theory. 
In  fact,  the  question  “What  is  a  model?”  is  far  from  settled  for  the  many  higher 
order  logics  of  interest  to  logicians  and  computer  scientists.  Some  of  the  proposed 
models  have  a  somewhat  ad  hoc  character,  and  may  fail  to  reflect  adequately  the  basic 
intuitions.  Nevertheless,  models  are  essential  to  semantic  understanding.  Category 
theory,  and  in  particular  the  categorical  approach  to  logic  originating  in  the  wide  and 
seminal  work  of  F.W.  Lawvere,  has  much  to  offer  in  this  regard.  This  is  probably  quite 
widely  recognized  and  in  some  cases,  such  as  the  relationship  between  the  typed  A- 
calculus  and  cartesian  closed  categories,  or  between  intuitionistic  set  theory  and  topos 
theory,  well  understood.  However,  the  potential  of  the  categorical  point  of  view  has 
yet  to  be  fully  exploited. 

There  is  no  better  way  to  begin  our  discussion  of  categorical  logics  than  by  quoting 
some  words  from  a  fundamental  paper  published  by  F.W.  Lawvere  in  1969  [46].  The 
paper  begins  with  the  following  words: 

“That  pursuit  of  exact  knowledge  which  we  call  mathematics  seems 
to  involve  in  an  essential  way  two  dual  aspects,  which  we  may  call  the 
Formal  and  the  Conceptual.  For  example,  we  manipulate  algebraically  a 
polynomial  equation  and  visualize  geometrically  the  corresponding  curve. 

Or  we  concentrate  in  one  moment  on  the  deduction  of  theorems  from  the 
axioms  of  group  theory,  and  in  the  next  consider  the  classes  of  actual  groups 
to  which  the  theorems  refer.  Thus  the  Conceptual  is  in  a  certain  sense  the 
subject  matter  of  the  Formal.” 

and  ends  with  the  following  paragraph: 

“Finally,  in  Foundations  there  is  the  familiar  Galois  connection  between 
sets  of  axioms  and  classes  of  models,  for  a  fixed  set  of  relation  variables 
Ri.  Globalizing  to  an  adjoint  pair  allows  making  precise  the  semantical 
effect,  not  only  of  increasing  the  axioms,  but  also  of  omitting  some  relation 
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symbols  or  reinterpreting  them,  in  a  unified  way.  And  if  we  deal  with 
categories  of  models,  allows  the  latter  to  determine  their  own  full  sets  of 
natural  relation  variables,  thus  giving  definability  theory  a  new  significance 
outside  the  realm  of  axiomatic  classes.  To  do  this  for  a  given  species  — 
equational,  elementary,  higher-order,  etc. —  of,  say,  /-sorted  theories,  one 
defines  an  adjoint  situation 


semantics  ,  _  f  _  r,v 

Theories0*  ^  :  .  *  (Cat, [Sets1 ) 

struct  are 

in  which  the  right  hand  side  denotes  a  category  whose  morphisms  are 
commutative  triangles 

C - ►  C 

Sets1 

of  functors  with  C  and  C  more  or  less  arbitrary  categories.  The  invariant 
notion  of  theory  here  appropriate  has,  in  all  cases  considered  by  the  au¬ 
thor,  been  expressed  most  naturally  by  identifying  a  theory  T  itself  with  a 
category  of  a  certain  sort,  in  which  case  the  semantics  (category  of  Models) 
of  T  is  a  certain  subcategory  of  the  category  of  functors  T  — ►  Sets.  There 
is  then  a  further  adjoint  situation 

Formal  ,  r  Theories 

describing  the  presentation  of  the  invariant  theories  by  means  of  the  for¬ 
malized  languages  appropriate  to  the  species.  Composing  this  with  above, 
and  tentatively  identifying  the  Conceptual  with  categories  of  the  general 
sort  (Cot,  [Sets7]),  we  arrive  at  a  family  of  adjoint  situations 

Formal °*  ,  *  Conceptual 

(one  for  each  species  of  theory)  which  one  may  reasonably  hope  consitute 
the  fragments  of  a  precise  description  of  the  duality  with  which  we  began 
our  discussion.” 

The  use  of  the  category  Sets 1  is  more  an  illustration  for  the  case  of  classical  set- 
theoretic  models  than  a  necessary  requirement.  The  version  axiomatized  below  has  a 
category  J_  of  categories  with  structure  instead  of  Sets1.  The  relationship  with  the 
axiomatic  definition  of  logic  presented  in  Section  2  is  as  follows: 

•  The  category  that  Lawvere  calls  “ Formal ”  coincides  with  the  category  Th0  for 
a  logic  £,  in  our  sense. 


33 


•  A  theory  in  Lawvere’s  sense  is  a  category  C  with  a  certain  structure.  That 
structure  is  meant  to  capture  the  essential  aspects  of  a  logic  C,  so  that  the 
category  C  can  be  understood  as  an  abstract  “theory”  that  is  independent  of 
both  a  choice  of  syntax  for  £  and  a  particular  presentation  of  the  axioms.  In 
fact,  such  categories  can  be  viewed  both  as  abstract  “theories,”  and  as  “generic” 
models.  For  example,  a  typed  lambda  calculus  theory  (H,E)  generates  a  free 
cartesian  closed  category  /(E  ,E),  which  is  the  abstract  theory  in  Lawvere’s 
sense,  and  also  the  generic  “term”  model  of  the  theory  (E ,2?). 

•  To  avoid  confusion  between  concrete  and  abstract  theories,  I  identify  Lawvere’s 
category  Theories  with  a  category  X  whose  objects  are  categories  with  some  ad¬ 
ditional  structure,  and  whose  morphia  ms  are  functors  preserving  that  structure. 
For  example,  in  the  case  of  the  typed  A-calculus,  X  =  CCCat.  the  category  of 
cartesian  closed  categories,  with  morphia  ms  functors  that  strictly  preserve  the 
cartesian  closed  structure  [43]. 

Example  39  (General  Equational  Logic)  Equatiohal  logic  was  the  first  instance  of 
a  categorical  logic  considered  by  Lawvere  in  his  doctoral  dissertation  [45].  Lawvere 
restricted  his  analysis  to  classical  set-theoretic  models.  Given  an  equational  theory 
(E,E),  he  exhibited  a  category  with  finite  products  7(11,  E)  such  that  E-algebras 
A  that  satisfy  the  equations  E  can  be  put  into  1-1  correspondence  with  functors 
A  :  7(11,  E)  -*  Set  that  strictly  preserve  products;  i.e.,  chosen  products  in  7(11,  E) 
are  mapped  to  cartesian  products  in  Set. 

The  category  7(11,  E)  is  easy  to  describe.  Its  objects  are  the  natural  numbers. 
A  morphism  [t]  :  n  — ♦  1  is  the  equivalence  class  modulo  the  equations  E  of  a  E- 
terzn  t  whose  variables  are  among  xlf  A  morphism  n  — *  m  is  an  m-tuple 

of  morphiszns  n  — ♦  1.  Morphism  composition  is  term  substitution.  For  example, 
[xj  +  xi]  o  ([i7  *  xa],  [x4  +  xs])  =  [(x4  +  xs)  +  (xy  *  Xa)].  It  is  then  easy  to  see  that 
the  object  n  is  the  nth  product  of  the  object  1  with  projections  [xi],...,  [x„]_;  and,  more 
generally,  that  the  product  of  the  objects  n  and  misr»+m.  The  functor  A  associated 
to  the  algebra  A  sends  the  morphism  [tj  :  »  — ►  1  to  the  derived  operation  An  — ►  A 
associated  to  the  term  t.  Under  this  correspondence  between  algebras  and  functors, 
an  equation  t  =  tf  is  satisfied  by  a  (E,  E)-aIgebra  A  iff  A([tj)  =  A([t/]).  The  analogous 
case  of  many-sorted  equational  logic  was  studied  by  B6nabou  in  his  thesis  [4],  The 
category  7(11,  E)  is  constructed  as  in  the  unsorted  case,  but  now  it  has  as  its  set  of 
objects  the  free  monoid  S'  generated  by  the  set  S  of  sorts. 

In  the  many-sorted  case,  we  can  view  the  construction  of  7(H,E)  as  a  functor 
7  :  Thp  — ♦  x  Cat  where  the  theories  in  Thp  are  many-sorted  equational  theories, 
x  Cat  is  a  category  whose  objects  are  small16  categories  with  chosen  finite  products 
and  whose  morphisms  are  functors  that  strictly  preserve  the  chosen  finite  products. 
The  functor  7  is  the  left  adjoint  of  a  functor  U  :  x  Cat  — ►  Thp  that  associates  to  each 
category  with  (chosen)  finite  products  C  the  many-sorted  equational  theory  U(C)  whose 
set  of  sorts  is  the  set  |C|  of  objects  of  C,  whose  ranked  set  of  operations  has  as  constants 
of  sort  A  the  morphisms  a  :  1  — ►  A,  with  1  the  chosen  final  object,  as  unary  operations 
of  type  A  — ♦  B  the  morphisms  /  :  A  — *  B  in  C,  and  as  n-ary  operations  of  type 

16 1  will  not  worry  much  about  foundations;  later  examples  will  also  involve  ‘large”  categories.  All 
concerns  can  be  resolved  using  universes. 
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Ai ...  An  — ►  C,  for  n  >  1,  symbols  /a,...a,»  one  for  each  /  :  B  =  A\  x  . . .  x  An  — ►  C 
in  C  as  well  as  new  operations  Uai..~ah  :  A\...An  — *  B  for  each  B  =  A\  x  . . .  x  An. 
The  equations  of  U(C)  include  the  equations  (^a,  (x)  , . . . ,  ^a*  (x))  =  x,  for  : 

A\  x  ...  x  An  — ►  A\  the  chosen  Ith  projection  in  C,  the  equations  (xi, . . . ,  xn)  = 

/(t'Aj—A*  (xi, . . . ,  xn)),  and  all  other  equations  satisfied  when  interpreting  the  i/Ai.~a«  ’s 
as  identities  and  the  /’ s  and  /a,„.a.’s  by  their  corresponding  morphisms  in  C.  For 
example,  if  h  :  C  — ►  D  with  D  =  A  x  B  is  the  unique  morphism  in  C  induced  by 
morphisms  /  :  C  — ►  A,  and  g  :  C  — ►  B,  then  we  have  an  equation  uAt3(f(x),  g(x))  = 
h(x). 

The  great  conceptual  advantage  of  viewing  a  T- algebra  as  a  product  preserving 
functor  7  ( T )  — ►  Set  is  that  the  concept  generalizes  immediately  to  that  of  a  T-algebra 
in  any  category  with  finite  products.  Thus,  for  Tarp  the  theory  of  groups,  a  topological 
group  can  be  regarded  as  a  product  preserving  functor  7{Tgtp)  — ►  Top  landing  in 
the  category  Top  of  topological  spaces,  and  a  sheaf  of  groups  on  a  topological  space 
X  can  be  viewed  as  a  product  preserving  functor  7{TgtP)  — *  Sheaves (X)  landing 
in  the  category  Sheaves  (X)  of  sheaves  on  X.  Therefore,  we  can  in  general  define  a 
group  in  a  category  with  finite  products  C  as  a  product  preserving  functor  7{Tgtp)  — ► 
C.  Since  /(Tcrp)  is  also  a  category  with  finite  products,  we  can  consider  the  group 
1  T(Ta,r)  ■  7(TGrp)  — ♦  7(TGrp),  which  can  be  understood  as  the  generic  group.  By 
construction,  this  group  satisfies  the  equation  t  =  t'  iff  [t]  =  [f'],  iff  the  equation 
t  =  t'  is  a  theorem  of  group  theory.  By  using  product  preserving  functors,  we  can 
even  relate  groups  in  different  categories.  For  example,  we  can  relate  a  sheaf  of  goups 
§  :  J[Tgtp)  — ♦  Sheaves(X)  to  its  group  of  global  sections  r(£)  by  composing  with  the 
global  sections  functor  T(_)  :  Sheaves  (X)  — ►  Set,  which  can  therefore  be  understood 
as  a  homomorphism17  between  those  two  groups.  Thus,  we  can  structure  all  the 
possible  groups  in  all  possible  categories  and  the  product  preserving  functors  that 
relate  them  as  the  slice  category  7{Tgn)/ x  Cat.  □ 

Definition  40  A  logic  L  is  called  a  categorical  logic  on  J_  if  there  is  a  category  T_ 
with  pushouts  and  with  a  faithful  functor  £  — ►  Cat  such  that: 

1.  There  are  functors  U  :  J_  — ►  Thp  and  7  :  Thp  — ►  £  with  7  left  adjoint  to  U. 

2.  The  functor  Mod  :  Tho°v  — *■  Cat  is  naturally  isomorphic18  to  the  functor 

Th0 "  £op  ^  Cat . 

where  the  functor  ./£  sends  an  object19  C  6  £  to  the  slice  category  C/T_. 

3.  For  any  theory  T  =  (E,  T)  and  sentence  <£>  6  sen(E)  we  have: 

T  <p  l/(r)  <p. 


□ 

17  A  more  general  notion  of  homomorphism  that  specialises  to  the  usual  one  when  all  the  groups  are 
in  the  category  Set  is  discussed  in  Remark  (2)  at  the  end  of  this  section. 

18To  simplify  the  discussion,  in  what  follows  I  ignore  this  isomorphism  and  identify  Mod(T)  with 

m/L. 

l9We  think  of  C  as  a  category  with  a  certain  structure,  and  of  T_  as  the  category  of  all  categories 
with  that  type  of  structure. 
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Example  41  The  general  equational  logic  example  where  J_  —  x  Cat  has  already 
been  discussed  in  detail.  Here  are  a  few  other  additional  examples: 

1.  £  the  logic  of  the  typed  lambda  calculus,  where  X.  —  CCCat  is  the  category  of 
cartesian  closed  categories;  see  [43]  and  Section  6.2. 

2.  £  higher  order  intuitionistic  logic,  where  X.  =  Toposts  is  the  category  of  elemen¬ 
tary  Lawvere-Tierney  to  poses;  see  [6]. 

3.  £  the  logic  of  Martin- Lof  type  theory  with  equality  types,  where  X  =  LCCCat 
is  the  category  of  locally  cartesian  closed  categories;  see  [65]  and  Section  6.2. 

4.  £  the  logic  of  Martin- Lof  type  theory  without  equality  types,  where  X  is  ei¬ 
ther  Cartmell’s  category  of  contextual  categories,  or  the  category  RCCCat  of 
relatively  cartesian  closed  categories;  see  [10,41]. 

5.  £  the  logic  of  the  Girard-Reynolds  polymorphic  lambda  calculus  [21,60],  where 
X  =  PLCat  is  Seely’s  category  of  PL-categories;  see  [63]  and  Section *6.2. 

6.  £  the  logic  of  the  Girard-Reynolds  polymorphic  lambda  calculus  [21,60],  where 
X  —  RCCCat  is  the  category  of  relatively  cartesian  closed  categories;  see  [52] 
and  Section  6.2. 

7.  £  Girard’s  linear  logic  [22],  where  X  is  Seely’s  category  of  linear  categories;  see 
[64,56],  and  the  related  [14]. 

□ 


Categorical  logics  have  very  nice  model-theoretic  properties  indeed,  as  expressed 
in  the  following  theorem. 

Theorem  42  Any  categorical  logic  £  is  complete,  liberal,  exact,  and  admits  initial 
models. 

Proof:  Completeness  is  clear  from  condition  (3),  since  we  have 

r  f=r  <p  =>  ir(sj)  Ns  s=>  r  I"e  <p. 

For  any  C  €  X>  the  map  lc  :  C  -*  C  is  obviously  an  initial  object  in  the  slice 
category  C/X •  Therefore,  1?(t)  is  an  initial  object  in  T{T)/J_  and  as  a  consequence 
Mod(T)  has  an  initial  model. 

Liberality  is  a  direct  consequence  of  the  following  well  known  lemma. 

Lemma  43  Let  A  be  a  category  with  pushouts,  and  /  :  C  — *  C'  a  morphism  in 
A.  Then,  the  “composition  along  /”  functor  f/A  :  C" / A  — ►  CjA  mapping  each 
h  :  C'  -*  D  to  h  o  f  :  C  — *  D  has  a  left  adjoint  given  by  “pushout  along  /”  that  maps 
each  g  :  C  — ►  E  to  the  map  C'  -*  E'  in  the  pushout  diagram 
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□ 

For  exactness,  notice  that,  since  7  :  Thp  — ►  X  is  a  left  adjoint,  by  duality  7op  : 
Th oop  — *■  £op  is  a  right  adjoint.  Since  right  adjoints  preserve  limits  (see  [47],  Theorem 
V.5.1)  and  Mod  =  L/T)  o  7op,  we  only  have  to  show  that  _/£  preserves  finite  limits. 
This  follows  from  the  easy  lemma  below. 

Lemma  44  For  any  category  A,  the  functor  ./A  :  A^  — ►  Cat  preserves  pullbacks. 
□ 

□ 


In  the  passage  of  Lawvere’s  paper  [46]  cited  above,  Lawvere  mentions  a  structure- 
semantics  adjointness  result.  This  result  appeared  in  his  thesis  [45]  for  the  case  of 
algebras  on  the  category  of  sets  and  has  since  then  been  generalized  in  many  directions. 
However,  I  am  not  aware  of  other  formulations  with  the  degree  of  generality  of  Theorem 
45  below. 

Notice  that  the  functor  _/£ :  Top  — ►  Cat  factors  as: 

Top  m  Cat! IT  — »  Cat. 

where: 

•  Cat  //T  is  the  full  subcategory  of  the  slice  category  Cat/T  given  by  those  func¬ 
tors  A.  — *  £  such  that  A  has  an  initial  object  J*  that  we  assume  chosen  once 
and  for  all; 

•  the  functor  sem  sends  C  to  the  projection  functor  C/T  — *  £  :  (C  — ►  V)  >-*•  V , 
and 

•  the  functor  Cat  //  T  — ►  Cat  is  the  projection  functor  (A  — ►  £)  A- 

Theorem  45  For  £  a  categorical  logic  on  £,  the  functor  sem  :  £op  — ►  Catf/T  is 
full  and  faithful,  and  has  a  left  adjoint  str  :  Catl/T  — ►  £op. 

Proof:  The  functor  sem  is  clearly  faithful  since,  given  /,  /'  :  C  —*•  C  in  £  with 
/  ?£  /',  we  have  (//£)(lc.)  =  /#/'  =  (f'/T)(lC'). 

To  see  that  it  is  full,  let  H  :  Cf/£  — y  C IT-  be  a  functor  such  that 

C/T  - — - *C/£ 

XXX^£ 

commutes.  Then  H  has  to  send  the  object  lc»  to  a  map  /  :  C  — »  C  in  £.  We  claim  that 
H  =  //£.  Indeed,  since  H  commutes  the  triangle,  and  there  is  a  morphism  g  :  lc  — ^ ►  g 
in  C/T  for  any  g  :  C'  — *  V,  we  must  have  a  morphism  H(g)  =  g  :  f  — »  H(g)  in  C/T 
and  therefore  H(g)  =  g  o  f,  so  that  H  and  //£  coincide  on  the  objects.  H  and  //£ 
coincide  trivially  on  the  morphisms,  since  for  any  h  :  g  —*  g1  in  C/T  we  must  have 
H(h)  =  h  =  (//£)(h),  because  both  H  and  //£  commute  the  triangle. 
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The  left  adjoint  str  :  Cat//T  — ►  JJ9  sends  an  object  A  :  A  — *•  X  to  the  object 
A{Ia)  €  Tj  and  a  morphism  H  :  A  -+  B  to  the  morphism  B(h )  :  B(Igj  -+  A(I^)  in 
T,  for  h:  !§_-*  H(Ijj)  the  unique  morphism  in  functoriality  then  follows  from  the 
initiality  of  each  lx  in  its  category  X . 

The  functor  A  :  A  — ►  X  factors  through  as 

A  -S4*  A(/^/X  — ►  X 

with  rjA[X)  =  A(h)  :  A(I^)  —*  A{X),  for  h  :  ->  X  the  unique  morphism  in  A,  and 

rfA(g)  =  A(g)  for  g  :  X  -+Y.  This  yields  our  desired  map  tja  :  A  — ►  aem(str(.A)). 

Let  now  D  :  A  — ►  sem(C)  be  a  morphism  in  Cat/jT  and  assume  that  there  is  a 
morphism  X)  :  sem(str(vl))  — ►  sem(C)  such  that  15  o  tja  =  D.  Then,  since  we  have 
shown  that  sem  is  full  and  faithful,  we  must  have  ~D  =  //X  •  A{Ia)/J_  — ►  C/X  for  a 
unique  /  :  C  — ►  A(l^).  But  the  equation  o  rjA  =  D  forces 

VMI*))  =  25(  w  =  (//r)(W  =  /  =  D(4) 

and  makes  D,  if  it  exists,  unique,  namely,  D  =  D(I^) /X-  Since  D(Ia)/T_  is  the 
identity  on  morphisms,  and  D  sends  g  :  X  — ►  Y  in  A  to  A(g)  :  D(X )  — ►  D(Y)  with 
D(Y)  =  A(g)  o  D(X),  to  show  that  indeed  D(Ia)/X.°  tja  =  D  it  is  enough  to  check  it 
on  the  objects.  But,  for  any  X  €  A>  we  have 

D(X)  =  A(h)  o  L>(4)  =  naW  o  DW  =  (L>(4)/X)M*)), 
for  h  :  — ►  X  the  unique  morphism  in  A •  □ 

Corollary  46  For  any  C  in  X  there  is  a  natural  isomorphism  C  =  atr(sem(C)). 

Proof:  This  follows  directly  from  sem  full  and  faithful  right  adjoint;  see  [47] ,  T^  orem 
IV.3.1.  □ 

The  structure-semantics  adjointness  theorem  makes  clear  why  Lawvere  calls  X  the 
category  of  (abstract)  theories.  We  can  think  of  7[T)  as  the  abstract,  presentation 
independent,  theory  specified  by  the  presentation  T.  Indeed,  we  can  establish  an 
equivalence  relation  among  theories  T,  Vm  €  Thp  by  defining  T  =  V  iff  there  is  an 
isomorphism: 


*(T)/x— s—  nr)/z 

that  is,  T  =  V  iff  the  corresponding  categories  of  models  are  isomorphic  in  a  way  that 
is  consistent  with  their  projection  functors  to  X-  We  then  have, 

Corollary  47  T  =  T  iff  7{T)  S  7{T). 

Proof:  The  “if”  part  is  clear.  For  the  “only  if”  part,  let  T  =  T .  Then  sem(7(T))  = 
sem{7{T)),  and  applying  the  functor  str  we  get 

7{T)  S  str(sem{7{T)))  3  s*r(sem(7(r')))  3  T{T'), 

as  desired.  □ 
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Remarks : 


1.  In  some  instances,  the  class  of  models  is  restricted  by  restricting  the  class  of 
categories  C  on  which  a  model  M  :  7(T)  — ►  C  of  a  theory  T  can  land.  For 
example,  in  the  original  treatment  of  equational  logic  given  by  Lawvere  [45],  the 
category  C  must  be  the  category  Set.  More  generally,  one  could  restrict  the 
class  of  categories  by  requiring  that  they  belong  to  the  image  of  a  functor  V  : 

For  example,  all  topos  models  of  the  polymorphic  lambda  calculus  are 
obtained  by  restricting  the  corresponding  relatively  cartesian  closed  categories  to 
be  toposes,  i.e.,  to  belong  to  the  image  of  the  forgetful  functor  V  :  Toposes  — ► 
RCCCat  [52].  Therefore,  given  a  functor  V  :  W  — ►  X  and  a  categorical  logic 
Z  on  Xi  we  can  define  the  V -restriction  Z\v  of  Z  to  V  as  the  logic  with  same 
entailment  system  as  Z  and  such  that,  for  T  a  theory,  Mod(T)  is  the  “comma 
category”  [47]  7(T)/V  whose  objects  are  pairs  (M  :  7(T)  — ►  V(D),D),  with 
M  :  7(T)  — ►  V(P)  in  X  and  V  G  W.,  and  with  morphisms  E  :  (M  :  7(T )  — ► 
V(D),D)  — *  ( M 1  :  7(T)  — ►  V(P'),P')  morphisms  H  :  D  — *  V  in  W_  such 
that  V(H )  o  M  =  M'.  Satisfaction  is  defined  as  before,  i.e.,  ( M,D )  (=  <p  in 
Z\v  ifi  M  \=  m  Z.  This  notion  of  V-restriction  includes  the  case  when  C  is 
constrained  to  be  just  one  category:  in  that  case,  we  take  as  our  V  the  functor 
from  the  one  morphism  category  1  to  X  that  picks  up  the  category  C.  For  any 
“restriction  functor”  V  there  is  an  associated  map  of  logics  Z  — ►  Z\v,  called  its 
restriction  map.  The  case  when  V  :  W.  — ►  X  has  a  left  adjoint  K  is  particularly 
interesting,  since  then,  the  comma  category  T(T)/V  is  isomorphic  to  the  slice 
category  K[ J(T))/1£.  Therefore,  if  the  unit  map  r j?(T)  -  7(T)  — ♦  V{K{J{T ))) 
is  such  that  T  H  <p  rj?(T)  h=  then  it  follows  easily  that  the  logic  Z\v  is  in 
fact  a  categorical  logic  on  W.. 

2.  The  definition  of  categorical  logic  given  above  is  satisfactory  and  general  for 
the  models.  Such  models  are  functors  of  the  form  M  :  7{T)  — ►  C,  for  T  the 
theory  in  question,  that  satisfy  the  additional  properties  of  morphisms  in  X- 
However,  the  notion  is  too  restrictive  for  homomorphisms.  The  only  homomor- 
phisms  permitted  between  two  models  M  :  7{T)  — ►  C  and  M'  :  7{T)  — +  C 
are  functors  H  :  C  — *  C  in  X  such  that  M'  =  H  o  M.  Consider  the  equational 
logic  case  already  discussed  in  Example  39,  where  ordinary  E-algebras  A  satis¬ 
fying  equations  E  were  placed  in  a  1-1  correspondence  with  product-preserving 
functors  A  :  7(71,  E)  — ►  Set.  Under  such  correspondence,  E-homomorphisms 
/  :  A  — *  B  can  be  put  into  1-1  correspondence  with  natural  transformations 
/  :  A  ==>  B.  Therefore,  in  order  to  give  a  full  account  of  homomorphisms  we 
should  allow  for  natural  transformations  in  our  definition.  The  point  is  that  T_ 
should  be  not  just  a  category,  but  a  2-category  [47],  and  the  forgetful  functor 
X  — *■  Cat  should  be  a  2-functor.  This  leads  to  the  definition  of  a  2 -categorical 
logic.  The  details  of  this  definition  will  be  given  elsewhere. 


6  Axiomatizing  Logic  Programming 

What  does  programming  in  a  logic  mean?  We  can  begin  to  answer  this  question  by 
stating  informally  some  of  the  requirements  that  a  logic  programming  language  should 
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satisfy.  I  call  the  view  represented  below  the  “weak”  view. 


Weak  Logic  Programming.  A  program  P  in  a  logic  programming  lan¬ 
guage  is  a  theory  in  a  logic  £.  After  entering  the  program  P  into  the 
machine,  the  user  can  ask  questions  about  his/her  program.  Such  ques¬ 
tions,  called  queries,  belong  to  a  specified  class  of  sentences  in  the  language 
of  P.  When  the  user  submits  a  query  <p,  if  it  is  the  case  that  <p  is  a  prov¬ 
able  consequence  of  the  axioms  in  P,  then  the  machine  will  return  a  set 
of  answers  justifying  the  truth  of  <p.  We  can  view  each  of  these  answers 
as  different  proofs  of  the  truth  of  99;  such  “proofs”  may  reasonably  omit  a 
good  part  of  the  information  that  a  completely  detailed  proof  would  pro¬ 
vide.  If  the  query  <p  is  not  provable  from  P,  two  things  can  happen:  either 
the  machine  stops  after  a  finite  amount  of  time  with  the  answer  “failure,” 
or  otherwise  the  machine  loops  forever.  Therefore,  two  things  are  made 
equivalent:  computation  in  the  machine,  and  deduction  in  the  logic. 

One  should  of  course  add  that  in  some  pragmatic  sense  the  implementation  in  the 
machine  should  be  reasonably  efficient  so  that  for  a  broad  enough  class  of  applications 
the  language  can  in  fact  be  used  in  practice;  otherwise  such  a  system  should  be  better 
described  as  a  theorem  prover.  We  could  summarize  the  weak  view  with  the  slogan 

Computation  =  Deduction. 

Although  this  view  is  probably  the  most  commonly  held,  I  do  not  take  it  as  primary. 
The  problem  with  it  is  that  it  makes  no  reference  to  the  models  that  the  theory  is  a 
linguistic  device  for.  A  theory  may  in  principle  have  many  models.  However,  when 
solving  a  particular  problem,  such  as  computing  a  numerical  function  or  sorting  a  list 
of  names,  we  usually  have  a  specific  model  in  mind,  such  as  the  integers,  the  real 
numbers,  or  the  set  of  all  sequences  of  expressions  of  a  certain  kind.  Such  a  model  is 
then  the  intended  or  standard  model  of  the  theory,  and  its  conceptual  importance  is 
primary;  the  theory  serves  only  a  secondary  role  as  a  linguistic  device  for  describing 
the  model.  In  the  logic  programming  literature,  the  standard  model  is  referred  to  as 
the  “closed  world”  that  the  program  describes.  In  a  wide  variety  of  cases  this  standard 
model  can  be  characterized  as  an  initial  model. 

Let  us  denote  by  Ip  the  model  intended  by  our  program  P.  In  the  context  of  such 
a  model,  the  meaning  of  a  query  <p  acquires  a  new  significance.  Our  primary  interest 
is  not  in  truths  that  are  generally  valid  for  all  models.  Rather,  our  interest  is  in  the 
facts  that  are  true  about  our  model.  In  other  words,  we  are  primarily  interested  in 
the  satisfaction  of  the  query  <p  by  the  model  Ip  and  only  secondarily  in  the  provability 
of  <p  from  the  axioms  in  P.  The  theory  P  is  a  linguistic  device  through  which  such 
satisfaction  may  be  verified,  since  if  the  query  is  provable,  it  must  be  true  in  all  models 
and  therefore  it  should  be  a  true  fact  about  Ip.  The  most  satisfactory  way  of  exploiting 
provability  as  a  method  of  settling  facts  about  our  model  is  to  restrict  our  attention 
to  queries  for  which,  conversely,  if  the  query  is  true  in  the  intended  model  Ip,  then  it 
is  provable  from  P.  Otherwise,  in  cases  when  the  query  cannot  be  proved,  we  would 
be  left  with  the  doubt  as  to  whether  or  not  it  is  true  in  our  model.  As  we  shall  see, 
this  is  a  widely  exploited  property  that  I  call  “query  completeness.”  It  leads  us  to  the 
following  stronger  requirements  for  a  logic  programming  language, 
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Strong  Logic  Programming.  A  program  P  in  a  logic  programming 
language  is  a  theory  in  a  logic  Z.  The  mathematical  semantics  of  the  pro¬ 
gram  P  is  a  model  Ip  of  the  theory  P  that  is  standard  in  an  adequate 
sense.  After  entering  the  program  P  into  the  machine,  the  user  can  ask 
questions  about  what  properties  hold  in  his/her  model.  Such  questions, 
called  queries ,  belong  to  a  specified  class  of  sentences  in  the  language  of 
P  and  have  the  property  that  for  sentences  <p  in  that  class  the  standard 
model  Ip  satisfies  <p  if  and  only  if  <p  is  provable  from  the  axioms  of  the 
theory  P.  When  the  user  submits  a  query  <p ,  if  it  is  the  case  that  <p  is  a 
provable  consequence  of  the  axioms  in  P,  then  the  machine  will  return  a 
set  of  answers  justifying  the  truth  of  <p.  We  can  view  each  of  these  answers 
as  different  proofs  of  the  the  truth  of  <p\  in  other  words,  the  operational 
semantics  of  the  language  is  given  by  some  proof  theory.  If  the  query  <p  is 
not  provable  from  P  two  things  can  happen:  either  the  machine  stops  after 
a  finite  amount  of  time  with  the  answer  “failure,”  or  otherwise  the  machine 
loops  forever.  Therefore,  three  things  are  made  equivalent:  computation 
in  the  machine,  deduction  in  the  logic,  and  satisfaction  in  the  standard 
model. 

Of  course,  the  efficiency  requirement  applies  exactly  as  before,  and  provides  the 
pragmatic  boundary  between  theorem  proving  and  logic  programming.  We  can  sum¬ 
marize  the  strong  view  of  logic  programming  under  the  slogan 

Computation  =  Deduction  =  Satisfaction  in  the  standard  model. 

I  have  already  mentioned  that  the  weak  view  of  logic  programming,  being  exclu¬ 
sively  proof-theoretic  in  nature,  is  unsatisfactory.  Nevertheless,  weak  logic  program¬ 
ming  seems  to  have  the  advantage  of  having  a  broader  range  of  applicability,  so  that 
it  could  cover  certain  examples  of  logic  programming  languages  for  which  strong  logic 
programming  might  prove  too  restrictive.  However,  we  have  already  seen  in  Proposi¬ 
tion  9  that,  thanks  to  the  generality  of  the  axioms  for  a  logic,  we  can  always  associate 
a  model  theory  to  an  entailment  system  so  that  the  entailment  system  becomes  a  com¬ 
plete  logic  with  initial  models.  This  shows  that  there  is  no  need  for  carrying  along  two 
different  notions.  Surprisingly  enough,  we  can  actually  understand  the  weak  view  of 
logic  programming  not  as  a  broader  notion,  but  rather  as  a  special  case  of  the  strong 
notion,  one  for  which  the  models  are  proof-theoretic  structures.  This  suggests  making 
strong  logic  programming  our  basic  notion.  This  is  a  richer,  conceptually  and  seman¬ 
tically  more  satisfactory  notion,  yet  in  the  sense  just  explained  it  is  the  notion  that  is 
most  broadly  applicable. 

Definition  48  axiomatizes  the  strong  logic  programming  view.  This  definition  is  a 
further  step  in  a  series  of  previous  attempts  by  J.A.  Goguen  and  the  author  to  articu¬ 
late  a  broad  view  of  logic  programming  open  to  many  logics  and  languages.  The  paper 
[32]  presented  this  view  and  used  it  as  a  natural  way  to  unify  two  logic  programming 
language  paradigms,  the  functional  and  the  relational,  by  unifying  their  logics.  It  also 
argued  that  every  program  should  have  an  initial  model  as  its  mathematical  seman¬ 
tics,  and  showed  that  this  was  the  case  for  first  order  functional  programming,  first 
order  relational  programming  with  Horn  clauses,  and  their  unification.  This  -iew  was 
made  formal  in  a  paper  by  J.A.  Goguen  [24]  using  institutions.  Goguen  proposed  that 
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logic  programming  languages  should  have  an  underlying  institution  so  that  the  state¬ 
ments  of  the  language  were  sentences  in  that  institution,  the  operational  semantics 
was  given  by  an  efficient  form  of  deduction  in  that  institution,  and  the  mathematical 
semantics  was  given  by  a  class  of  models,  preferably  initial;  the  paper  [28]  by  Goguen 
and  Buxstall  also  proposed  this  formalization.  The  definition  below  is  very  much  in 
the  same  spirit,  but  it  combines  the  proof-theoretic  and  model-theoretic  aspects  of  the 
issue  using  the  concepts  developed  in  this  paper  to  suggest  two  new  conditions.  One 
is  a  query  completeness  requirement  with  the  explicit  demand  that  what  is  provable 
should  coincide  with  what  is  true  in  the  initial  model;  the  other  is  a  formal  definition 
of  an  operational  semantics  as  an  effective  proof  subcalculus.  Also,  the  use  of  initial 
models  for  the  mathematical  semantics  is  here  made  mandatory. 

Definition  48  A  logic  programming  language  CP  is  a  4-tuple  CP  =  (C,  Signp, stat, 
quer)  with: 

1.  C  =  (Sign, sen, Mod,  (~,^=)  a  logic. 

2.  Signp  a  subcategory  of  Sign. 

3.  stat  :  Sign  — ►  Set  a  subfunctor  of  the  functor  obtained  by  composing  sen 
with  the  finite  powerset  functor,  i.e.,  there  is  a  natural  inclusion  stat(E)  C 
P/in(sen( £))  for  each  E  £  Sign.  Each  T  £  stat(E)  is  called  a  set  of  E -statements 
in  CP.  This  defines  a  subcategory  Th^gt  of  27tp  whose  objects  are  theories  P  = 
(E,T)  with  E  €  Signp  and  T  €  stot(E),  and  with  morphia  ms  axiom-preserving 
theory  morphisms  H  such  that  H  £  Signp.  Each  such  theory  P  £  Thitat  is  called 
a  program  in  CP. 

4.  quer  :  Sign  — ►  Set  a  subfunctor  of  the  sen  functor.  The  sentences  <p  £  quer(E) 
axe  called  the  E-queries  of  CP. 

In  addition,  the  following  properties  are  satisfied: 

Mathematical  Semantics:  Each  program  P  £  Tk^at  has  an  initial  model  Ip.  The 
denotation  function 

Ph  IP 

is  called  the  mathematical  semantics  of  CP. 

Query  Completeness:  For  each  program  P  =  (E,  T)  and  query  <p  £  quer( E)  we 
have 

<p  O  Ip  [=2  < p. 

Operational  Semantics:  There  is  an  effective  proof  subcalculus  of  the  form  0  = 
(Sign,  sen,  h,  Signp,  scrip,  stat,  qucr,P,Pr,ir),  i.e.,  having  ent(C)  as  its  underly¬ 
ing  entailment  system,  Signp  as  its  category  of  admissible  signatures,  stat  as  its 
axioms,  and  quer  as  its  conclusions. 

The  effective  proof  subcalculus  0  is  not  assumed  to  be  unique.  Any  such  0  is 
called  on  operational  semantics  for  the  logic  programming  language  CP.  □ 
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Notice  that,  as  pointed  out  at  the  end  of  Section  3.2,  given  an  operational  semantics 
0  for  a  logic  programming  language  HP,  every  program  P  has  an  associated  partial 
recursive  search  function 

searchp  :  quer(P)  x  IN  — ►  P/in{proofs(P)), 

so  that  we  can  ask  for  as  many  answers  to  a  query  <p  as  we  desire,  and  then  we 
get  back  the  answers  if  they  exist,  or  otherwise  either  information  about  failure  in 
finite  time  or  no  answer  at  all.  One  way  in  which  the  operational  semantics  of  the 
logic  programming  language  can  change  is  by  changing  the  mode  of  computation;  for 
example,  in  a  debugging  mode  answers  should  be  much  more  informative  than  in  a 
standard  mode.  The  axioms  for  an  effective  proof  subcalculus  are  very  flexible;  they 
allow  expressing  different  notions  of  “prooP  suitable  for  different  purposes  as  different 
subcalculi. 

This  finishes  our  axiomatization  of  logic  programming  languages?.  However,  the 
above  definition  has  the  drawback  of  not  taking  into  account  efficiency  considerations. 
In  practice,  we  would  not  be  willing  to  use  a  logic  programming  language  if  answers  to 
queries  were  to  take  an  inordinate  amount  of  time  compared  with  the  time  that  it  would 
take  to  compute  the  solution  to  the  problem  using  a  more  conventional  language.  We 
might  be  willing  to  accept  the  system  implementing  the  language  as  a  theorem  prover, 
but  not  as  a  programming  language.  Therefore,  some  pragmatic  line  must  be  drawn 
between  theorem  provers  and  programming  languages.  It  may  be  impossible  to  settle 
this  issue  once  and  for  all,  for  the  following  reasons: 

Emergence  of  increasingly  more  efficient  operational  semantics:  Linear  res¬ 
olution  made  it  possible  to  develop  interpreters  for  Horn  clause  logic,  and  term 
rewriting  allowed  equations!  logic  programming  interpreters.  Present  compila¬ 
tion  techniques  for  Horn  clauses  and  for  functional  languages  permit  developing 
compilers  that  make  the  efficiency  of  these  languages  entirely  acceptable  com¬ 
pared  with  more  conventional  languages  run  on  the  same  sequential  machines. 

New  models  of  parallel  computation  and  new  architectures:  These  can  dras¬ 
tically  alter  the  mathematical  complexity  of  many  problems  and  make  possible 
computations  that  were  not  feasible  with  previous  technology.  Logic  program¬ 
ming  languages,  thanks  to  their  declarative  character,  can,  in  fact,  play  a  leading 
role  in  the  discovery  of  such  new  models  and  architectures. 

Advances  in  hardware  technology:  For  the  moment,  these  show  a  dramatic 
increase  in  computing  speed  and  a  decrease  in  device  size,  although  the  laws  of 
physics  will  eventually  pose  a  hard  boundary  to  such  advances. 

However,  there  are  intrinsic  complexity  theory  bounds  that  no  technological  advance 
can  reverse.  Therefore,  one  of  the  most  important  tasks  in  logic  programming  is  to 
find  efficient  proof  subcalculi  for  those  entailment  systems  that  have  them. 

In  practice,  we  can  observe  a  migration  process.  First,  certain  logics  exist;  then, 
some  theorem  provers  are  developed  to  mechanize  their  deduction;  and,  finally,  some  of 
these  theorem  prover  techniques  are  found  to  be  efficient  and  give  birth  to  programming 
language  interpreters.  Each  new  language  in  turn  suggests  new  models  of  computation, 
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new  compilers,  and  new  architectures.  In  first  order  logic  programming  this  is  clearly 
the  trend,  and  in  type  theory  a  similar  trend  is  apparent  for  higher  order  logics.  Of 
course,  historical  developments  are  not  logical  necessities,  and  in  the  future  we  may 
find  more  and  more  language  designers  in  the  role  of  producers  rather  than  consumers 
of  new  logics. 

6.1  First  Order  Logic  Programming 

The  logic  programming  ideas  historically  originated  from  the  tradition  of  first  order 
resolution  theorem  proving  [42,72]  and  were  first  embodied  in  the  Prolog  language 
[11].  The  Prolog  culture  has  been  so  successful  that  for  many  researchers  the  part 
— i.e.,  Horn  clause  relational  programming  in  its  different  variants,  or  perhaps  first 
order  logic  programming  for  the  truly  ambitious —  seems  to  become  identified  with 
the  whole.  This  of  course  may  be  styfiing.  A  different  theorem  proving  tradition, 
namely  equational  theorem  proving,  has  existed  alongside  and  provided  term  rewriting 
techniques  that  were  recognized  by  several  researchers  in  the  late  seventies  as  a  very 
.  good  basis  for  designing  and  giving  semantics  to  functional  programming  languages. 
Pioneering  work  in  this  direction  includes  that  of  J.A.  Goguen,  who  created  the  OBJ 
language  [23,25],  and  M.J.  O’Donnell’s  thesis  [55],  In  the  1980’s  it  became  gradually 
apparent  that  these  two  styles  of  first  order  logic  programming,  the  relational  based 
on  Horn  clauses  and  the  functional  based  on  equations,  should  be  unified  and  several 
proposals  emerged.  The  Eqlog  language  [33]  was  the  first  proposal  suggesting  that 
this  unification  could  best  be  achieved  by  unifying  both  logics  into  Horn  clause  logic 
with  equality,  and  giving  an  initial  model  semantics  for  the  resulting  programs. 

What  follows  is  a  discussion  of  a  variety  of  first  order  logic  programming  styles. 
The  emphasis  is  on  the  particular  choices  of  statements,  queries  and  proofs.  I  mention 
some  languages  only  to  give  a  few  examples.  There  are  of  course  many  other  languages 
that  could  be  mentioned,  but  this  is  not  a  survey.  To  simplify  the  exposition,  I  present 
the  ideas  in  an  unsorted  first  order  logic  notation;  however,  all  that  I  say  generalizes 
to  many-sorted  and  order-sorted  first  order  logic. 

Horn  Clause  Logic  Programming 

In  this  case,  the  signatures  are  finite  first  order  signatures,  and  the  sets  of  statements 
are  finite  sets  of  Horn  clauses,  i.e.,  of  sentences  of  the  form 

V£  A  ■<= 

where  A,B\,...,Bn  are  atomic  formulas  that  do  not  involve  an  equality  predicate. 
The  queries  are  existential  sentences  of  the  form 

3x  Ci, ...,  Cf% 

with  Ci,  ...,Cn  atomic  formulas  not  involving  equality.  The  initial  model  of  a  program 
P  =  (£,T)  is  its  Herbrand  model  Tz, r>  whose  functional  part  consists  of  the  term 
algebra  7>  on  the  function  symbols  F  of  £,  and  for  each  n-ary  predicate  symbol  p  and 
f€7?we  have, 

t  e  prE.r  iff  r  p(t). 
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Query  completeness  is  a  direct  consequence  of  Herbrand’s  theorem.  The  standard 
operational  semantics  is  Horn  clause  resolution  [42].  A  proof  of  a  query  3x  C\, ...,  Cn 
for  a  program  P  is  a  substitution  6  such  that  h p  9(Ci), ...,  9{Cn).  Although  Prolog  [11] 
is  the  most  popular  Horn  clause  logic  programming  language,  its  extralogical  features, 
the  incompleteness  of  its  search  strategy  and  its  nonstandard  unification  make  it  fall 
short  of  the  logic  programming  ideal. 

Equational  Logic  Programming 

In  this  case,  signatures  are  finite  functional  signatures,  and  the  sets  of  statements  are 
finite  sets  of  Church-Rosser  and  terminating  equations20.  The  mathematical  semantics 
of  a  program  P  =  (F,  E)  is  given  by  the  initial  algebra  Tp<E  in  the  class  of  all  models 
of  the  theory  (F,F).  For  any  set  E  of  F-equations,  the  initial  algebra  TFiE  has  a  very 
simple  construction  as  the  quotient  algebra  of  the  term  algebra  Tp  by  the  congruence 
relation  =E  defined  by 

t  =E  t'  iff  E  \-p  t  =  V. 


See  the  original  ADJ  paper  [37],  or  the  survey  [53]  for  a  detailed  proof  of  the  initiality 
of  Tp f  =E.  The  operational  semantics  is  term  rewriting,  i.e.,  equational  deduction 
using  the  equations  only  from  left  to  right.  Since  the  equations  are  Church-Rosser  and 
terminating,  each  term  t  rewrites  after  a  finite  number  of  steps  to  a  unique  canonical 
form  canE(t)  that  cannot  be  further  simplified  by  the  equations.  The  term  canE(t)  is 
the  unique  canonical  representative  of  the  =£-equivalence  class  [t].  Therefore,  an  iso¬ 
morphic  (but  computationally  more  intuitive)  representation  of  the  initial  algebra  Tf,e 
can  be  given  as  the  set  CanEiE  whose  elements  are  ground  terms  of  the  form  canE(t); 
this  set  has  an  obvious  F-algebra  structure  making  it  isomorphic  to  Tp/  ~E,  namely 
for  /  €  Fn  and  tu...,tn  6  CanFtE  we  define  ••-,**)  =  canE(f(ti,...,tn)). 

Two  basic  computations  that  can  be  performed  by  term  rewriting  are:  reduction  of 
a  ground  term  t  to  canonical  form,  and  deciding  when  two  ground  terms  t  and  t'  are 
made  equal  by  the  equations.  They  give  rise  to  two  types  of  queries: 

1.  3x  t  =  x 

2.  t  =  t' 

where  the  sentence  3x  t  =  x  is  always  true  (since  we  could  choose  x  to  be  t  itself)  but 
we  are  interested  in  the  most  informative  proof  possible,  namely  one  where  x  is  chosen 
to  be  canE(t).  Therefore,  for  queries  <p  of  type  (l)  query  completeness  is  trivial;  for 
queries  of  type  (2)  it  follows  trivially  from  the  definition  of  =E. 

An  example  of  a  logic  programming  language  of  this  type  based  on  untyped  equa¬ 
tional  logic  is  the  language  of  Hoffmann  and  O’Donnell  [39,54].  The  OB  JO  language 
was  also  untyped  [25],  but  subsequent  versions  have  all  been  typed.  OBJ2  and  OBJ3 
[19,29]  are  based  on  order-sorted  equational  logic  [35],  where  types  can  be  related  by 
a  partial  order  subtype  relation,  e.g.,  Nat  <  Int ,  and  operations  can  have  several 

20 This  condition  can  be  relaxed,  e.g.,  by  dropping  the  termination  property.  O’Donnell  [54]  does  not 
require  termination,  but  imposes  sufficient  conditions  to  ensure  the  Church-Rosser  property.  In  any 
event,  Church-Rosser  and  terminating  equations  are  powerful  enough  to  specify  all  total  computable 
functions  [5|. 
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typings,  e.g.,  natural,  integer,  rational  and  complex  addition,  all  with  the  same  re¬ 
striction  to  subtypes;  this  makes  OBJ  programs  very  expressive.  The  expressiveness 
of  OBJ  programs  is  also  increased  by  the  use  of  conditional  equations,  so  that  the  whole 
discussion  above  should  be  understood  as  taking  place  in  the  context  of  conditional 
equational  logic. 

One  of  the  great  advantages  of  logic  programming  languages  is  that  they  are  declar¬ 
ative  and  make  no  commitments  to  a  particular  execution  sequence.  Therefore,  they 
can  be  used  to  design  and  program  entirely  new  parallel  architectures.  In  the  case 
of  equational  logic  programming,  one  such  architecture  is  the  Rewrite  Rule  Machine 
that  Goguen,  Leinwand,  Winkler,  Aida  and  I  are  building  at  SRI  [36].  Its  model  of 
computation  is  a  new  operational  semantics  for  equational  logic  programming  based 
on  concurrent  term  rewriting  [30]. 

Horn  Clause  Logic  Programing  with  Equality 

Horn  clause  logic  programming  is  relational ,  whereas  equational  logic  programming  is 
functional.  Each  approach  has  its  own,  somewhat  complementary,  strengths.  For  a 
problem  where  searching  is  crucial,  the  relational  approach  is  ideal.  However,  many 
computations  are  functional  in  nature  and  do  not  require  any  search  or  backtracking; 
for  those,  term  rewriting  is  best.  In  [32]  Goguen  and  I  suggested  that  the  functional 
and  relational  approaches  to  logic  programming  could  be  combined  by  combining  the 
corresponding  logics.  The  combination  is  of  course  Horn  clause  logic  with  equality. 
In  this  logic  programming  style,  the  signatures  are  finite  first  order  signatures,  and 
the  sets  of  statements  are  finite  collections  of  Horn  clauses  that  now  may  involve 
the  equality  predicate  and  such  that  the  clauses  for  equality  are  Church-Rosser  and 
terminating  in  a  suitable  sense21.  The  mathematical  semantics  is  an  initial  model 
semantics  that  generalizes  Herbrand  models  and  initial  algebras.  Given  a  first  order 
signature  E  =  (F,  P )  and  a  set  T  of  Horn  clauses  — possibly  involving  equalities —  the 
initial  model  Tzx  has  a  functional  part  consisting  of  the  quotient  algebra  of  the  term 
algebra  7>  under  the  congruence  relation  =r  defined  by 

t  sr  f*  iff  T  l-2  t  =  if 


and  a  relational  part  given  by 

([*i] *-,[*»»])  €  prc, P  iff  T  hh  p(ti,...,t„) 

for  each  p  €  P»  in  P.  See  [34]  for  a  detailed  proof  of  the  initiality  of  Tz,r  in  the 
general  case  of  order-sorted  logic,  that  contains  unsorted  logic  as  a  sublogic.  Queries 
are  existential  sentences  of  the  form 

3x  Ci,...,Cn 

with  Ci, ...,  Cn  atomic  formulas  but  now  some  of  them  can  be  equations.  Query  com¬ 
pleteness  is  a  direct  consequence  of  Herbrand’s  theorem;  see  [34]  for  a  proof  in  the 
general  case  of  order-sorted  logic.  Solving  queries  is  done  in  a  fashion  entirely  similar 
to  ordinary  Horn  clause  resolution;  the  only  difference  is  that  standard  unification  is 

31Since  we  have  Horn  clauses,  the  equations  may  be  conditional  and  even  have  predicates  other  than 
equality  in  their  conditions. 
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replaced  by  unification  modulo  the  equations  of  the  program.  By  the  Church-Rosser 
and  terminating  assumptions,  this  can  be  done  by  some  complete  strategy  for  narrow¬ 
ing  [40] .  As  before,  the  answers  to  queries  are  substitutions  that  make  the  instance  of 
the  query  provable.  The  Eqlog  language  [32]  is  an  example  of  a  language  in  this  style 
of  logic  programming;  its  logic  is  order-sorted  Horn  clause  logic  with  equality.  Other 
approaches  encode  predicates  as  functions  [18,15]  or  functions  as  predicates  [71]. 

Logic  Programming  in  other  Fragments  of  First  Order  Logic 

Although  Horn  clause  logic  with  equality  is  in  a  sense  the  end  of  the  road  for  fragments 
of  first  order  logic  admitting  initial  models  [48],  we  can  still  view  the  kind  of  weak 
logic  programming  that  is  possible  in  any  fragment  of  first  order  logic  as  an  instance 
of  strong  logic  programming  for  a  logic  £'  that  has  the  same  entailment  system  as 
first  order  logic,  but  has  a  different  underlying  institution  admitting  initial  models. 
There  are  several  such  possible  institutions.  One  is  given  by  Theorem  9,  which  in  a 
sense  is  the  ideal  model  theory  for  people  with  a  strong  proof-theoretic  bias;  another 
such  institution  could  have  hyperdoctrine  models  such  as  “logical  categories”  [73].  As 
a  model  theory,  such  choices  seem  preferable  to  standard  first  order  logic  where  one 
would  have  to  worry  about  many  different  models,  none  of  them  initial. 

6.2  Higher  Order  Logic  Programming 

Although  type  theory  and  logic  programming  share  a  vital  connection  with  logic,  they 
have  developed  in  relative  isolation  from  each  other.  Much  can  be  gained  at  both 
the  conceptual  and  practical  levels  from  an  attempt  to  understand  the  relationships 
between  these  two  fields.  Conceptually,  logic  programming  can  be  saved  from  becoming 
parochial  and  losing  important  new  opportunities,  and  type  theory  may  gain  new  logic 
and  model-theoretic  insights.  At  the  practical  level,  what  can  be  gained  is  a  much 
better  understanding  of  how  to  design  powerful  new  languages  that  integrate  such 
features  as  generic  modules,  higher  order  functions,  logical  variables  and  subtypes, 
and  yet  have  a  clear  and  rigorous  semantics  based  on  logic.  Categorical  logic  plays  a 
key  role  in  relating  type  theory  and  logic  programming,  so  that  functional  languages 
based  on  type  theory  can  be  understood  as  logic  programming  languages  in  the  strong 
sense  of  our  axiomatization. 

One  of  the  greatest  strengths  of  categorical  logics  is  that  they  unify  proof  theory 
and  model  theory  in  a  particularly  illuminating  way.  Given  a  theory  T,  the  associated 
free  category  7{T)  is  at  the  same  time  an  abstract  theory,  providing  a  notion  of 
equivalence  of  proofs,  and  the  initial  model  of  the  theory.  The  logic  programming 
axioms  underscore  the  importance  of  these  free  or  “term”  models,  because  in  them 
mathematical  and  operational  semantics  are  interlocked  as  two  aspects  of  the  same 
reality.  Being  initial,  these  models  are  in  a  sense  the  most  general,  and  being  generated 
by  the  rules  of  the  logic,  they  wear  their  operational  semantics  on  their  sleeves. 

Type  theories  can  be  used  to  define  programming  languages  with  powerful  type 
mechanisms.  They  can  also  be  used  as  formal  frameworks  to  reason  about  programs 
or  to  automatically  generate  correct  programs  from  their  specifications.  Although,  of 
course,  it  is  one  of  the  great  advantages  of  type  theory  that  formal  reasoning  about 
a  program  written  in  it  can  be  carried  out  in  the  type  theory  itself,  for  the  purposes 
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of  this  paper,  I  concentrate  on  the  first  use.  I  consider  three  well-known  examples 
of  type  theory:  the  typed  lambda  calculus,  the  G irard- Reynolds  polymorphic  lambda 
calculus,  and  Martin-Lof  type  theory.  All  of  these  calculi  are  higher  order  equational 
logics,  and  all  have  a  notion  of  reduction  entirely  similar  to  term  rewriting  in  first 
order  equational  logic.  Therefore,  in  all  three  cases  programs  are  finitary  equational 
theories  whose  equations  are  Church- Rosser  and  terminating  in  the  given  calculus,  and 
operational  semantics  is  given  by  reduction  to  canonical  form.  Queries  are  similar  to 
those  of  the  first  order  equational  case,  i.e.,  they  are  sentences  of  the  form: 

1.  3x  t  =  x 

2.  t  =  t> 

with  t  and  f  terms  in  the  appropriate  syntax23. 

The  Typed  Lambda  Calculus 

The  typed  lambda  calculus  is  a  categorical  logic  on  CCCat.  the  category  of  cartesian 
closed  categories.  The  mathematical  semantics  of  a  program  P  =  (E,P)  is  given  by 
its  initial  model  1/(e^b),  where  /(E,  E)  is  the  free  cartesian  closed  category  generated 
by  P  —  (E ,E)  (see  [43]  1.10-11  for  a  detailed  description  of  such  theories,  called  there 
“typed  lambda  calculi"  and  the  /(E  ,E)  construction;  note  that  they  assume  a  weak 
natural  numbers  object  in  their  theories,  but  this  is  not  an  essential  requirement). 
Query  completeness  is  trivially  satisfied.  A  particularly  elegant  operational  semantics 
is  provided  by  Curien’s  categorical  combinators  [13]. 

The  ML  language  [38]  is  closely  related  to  the  typed  lambda  calculus,  but  it  is 
instead  based  on  the  polymorphic  lambda  calculus;  expressions  in  general  do  not  have 
one  type,  but  a  family  of  types  that  are  the  instances  of  a  unique  type  expression 
involving  type  variables.  Categorical  combinators  can  be  used  to  provide  a  simple  and 
efficient  ML  implementation  [12]. 

The  Girard-Reynolds  Second  Order  Polymorphic  Lambda  Calculus 

This  calculus,  proposed  by  Girard  [21]  and  discovered  independently  by  Reynolds 
[60],  is  more  powerful  than  the  usual  polymorphic  lambda  calculus;  it  allows  universal 
quantification  of  type  variables  inside  type  formulas.  There  are  two  possible  categorical 
semantics.  One,  based  on  a  type  of  hyperdoctrines  called  PL-categories,  was  given 
by  Seely  [63];  the  other,  called  the  “universe  model  semantics,”  is  based  on  relatively 
cartesian  closed  categories  and  was  proposed  in  [52].  These  two  categorical  semantics 
are  related  by  a  map  of  logics  that  is  the  identity  on  the  underlying  entailment  system 
and  that  for  each  theory  T  provides  a  forgetful  functor  from  its  universe  models  to 
its  PL- category  models.  The  PL- category  models  provide  only  names  for  the  types, 
but  not  the  types  themselves;  the  universe  models,  however,  provide  type  extensions 
as  objects  of  a  category.  Again,  query  completeness  is  trivial  and  the  operational 
semantics  is  given  by  reduction  of  expressions  to  normal  form. 

33In  some  cases,  for  example  in  Martin-Lof  type  theory,  we  can  have  similar  queries  for  type  expres¬ 
sion t,  for  which  there  is  also  a  normal  form. 
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From  the  point  of  view  of  applications,  there  are  compelling  reasons  to  extend  the 
second  order  polymorphic  lambda  calculus  in  various  ways.  Although  all  the  provably 
terminating  functions  can  be  expressed  in  the  basic  calculus,  not  all  the  algorithms  for 
computing  a  function  of  this  kind  are  expressible,  and  therefore  it  is  more  expressive 
to  have  full  recursion.  Also,  to  do  programming-in-the-large,  it  is  very  useful  to  be 
able  to  compute  with  modules  as  values,  i.e.,  to  assume  that  Type:Type.  Extensions 
of  the  universe  model  semantics  that  provide  a  categorical  semantics  for  the  second 
order  polymorphic  lambda  calculus  with  full  recursion  and/or  with  Type:Type  are 
given  in  [52];  for  Type:Type,  a  categorical  semantics  was  given  in  [70];  proof  rules  for 
extensions  of  this  kind  are  given  in  [8].  The  language  Pebble  [7]  has  full  recursion, 
Type:Type  as  well  as  other  features;  the  Quest  language  [9]  is  also  in  this  category, 
and  in  addition  provides  subtypes. 

Mart  in- L  of  Type  Theory 

Martin-Lof  type  theory  [49,50]  provides  powerful  type  constructions  with  both  uni¬ 
versal  and  existential  quantification,  equality  types,  etc.  Normalization  of  expressions 
is  Church-Rosser  and  terminating  both  for  terms  and  for  type  expressions  and  pro¬ 
vides  the  operational  semantics.  The  work  of  Seely  [65]  has  shown  how  Martin-Lof 
type  theory  can  be  viewed  as  a  categorical  logic  on  LCCCat.  the  category  of  locally 
cartesian  closed  categories.  If  equality  types  are  dropped,  the  categorical  semantics 
can  be  broadened  in  several  closely  related  ways  such  as  contextual  categories  [10],  or 
relatively  cartesian  closed  categories  [41]. 

7  Concluding  Remarks 

The  main  focus  of  this  paper  has  been  on  basic  concepts  and  definitions.  Once  the 
basic  framework  is  set  up,  general  results  about  logics  satisfying  some  additional  con¬ 
ditions  should  be  investigated.  Results  of  this  nature  are  obtained  in  the  context  of 
traditional  set-theoretic  structures  by  the  methods  of  abstract  model  theory  [2],  and 
several  such  results  have  already  been  obtained  for  general  institutions  by  Tarlecki 
[67,68]  and  by  Sannella  and  Tarlecki  [61].  Some  of  the  additional  conditions  that  one 
may  want  to  impose  have  to  do  with  properties  of  the  category  of  signatures  and  the 
functor  of  sentences  that  cam  have  a  natural  formulation  in  terms  of  the  “charters” 
and  “parchments”  of  Goguen  and  Burstall  [28] . 

The  satisfaction  relation  between  sentences  and  models  can  be  seen  as  a  charac¬ 
teristic  function  taking  the  value  true  or  false.  B.  Mayoh  [51]  suggested  interesting 
applications  in  which  one  would  like  to  broaden  the  notion  of  “truth  value”  and  pro¬ 
posed  a  generalization  of  institutions  called  galleries.  Goguen  and  Burstall  [28]  gave 
a  nice  categorical  formulation  of  institutions  as  a  pair  of  functors  together  with  an 
extranatural  transformation  or  “wedge”  [47]  involving  the  truth  value  category  2  with 
objects  true  and  false  and  only  one  nonidentity  morphism  from  false  to  true ;  they  then 
suggested  a  notion  of  a  generalized  institution  that  can  be  obtained  replacing  2  by  a 
category  M  of  truth  values  in  their  categorical  formulation.  They  argued  that  Mayoh’s 
galleries  could,  after  some  modifications,  be  seen  as  generalized  institutions.  A  sub¬ 
stantial  portion  of  the  present  theory  could  have  been  developed  in  the  more  general 
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setup  of  a  truth  value  category  other  than  2,  and  the  implications  of  that  possibility  are 
an  interesting  topic  of  future  research.  However,  having  the  notion  of  a  proof  calculus 
available  may  decrease  the  need  for  the  extra  generality.  What  is  needed  is  a  study  of 
examples  to  see  how  naturally  they  can  be  expressed  in  the  different  frameworks.  The 
ideas  of  Poigne  [58],  who  has  proposed  another  way  of  generalizing  institutions  should 
also  be  taken  into  account. 

The  study  of  mappings  between  the  different  logical  structures  should  be  further 
developed.  Those  mappings  consist  of  a  functor  relating  the  two  categories  of  theories 
and  of  one  or  more  natural  transformations;  for  each  of  those  transformations  one 
could  in  principle  choose  between  a  “forward”  and  a  “backward”  direction.  However, 
not  all  possible  combinations  may  behave  well  or  have  interesting  examples.  I  have 
presented  particular  choices  that  seemed  natural,  had  interesting  examples  and  per¬ 
mitted  defining  a  notion  of  logical  substructure.  However,  such  choices  should  not 
exclude  other  possibilities  that  may  be  equally  useful.  More  experience  with  examples 
is  needed  to  ascertain  what  choices  should  be  favored;  computer  science  can  provide  a 
rich  source  of  examples  and  applications. 

Computer  science  applications  have  provided  the  original  stimulus  for  the  develop¬ 
ment  of  the  theory;  with  the  basic  concepts  now  in  place,  one  of  the  main  tasks  ahead 
is  to  bring  the  abstract  concepts  to  bear  on  specific  problem  areas  within  computer 
science.  For  logic  programming,  Section  6  has  given  just  the  beginnings  of  an  applica¬ 
tion,  but  several  other  important  issues,  such  as  compilation,  or  the  use  of  mappings 
between  logics  to  design  new  programming  languages  with  more  powerful  features  have 
not  been  discussed.  Type  theory  and  concurrency  are  also  areas  where  many  applica¬ 
tions  are  possible;  the  paper  [52]  gives  a  particular  type  theory  application,  and  the 
joint  paper  [56]  with  N.  Marti-Oliet  gives  an  application  to  concurrency.  Applications 
to  other  computer  science  areas  such  as  artificial  intelligence  and  automated  deduction 
also  seem  very  natural.  I  hope  that  other  researchers  will  find  the  methods  useful  and 
will  undertake  many  of  those  applications  themselves. 
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